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STREPTOCOCCUS PNEUMONIAE PROTEINS AND NUCLEIC ACID MOLECULES 

The present invention relates to proteins derived from Streptococcus pneumoniae, 
nucleic acid molecules encoding such proteins, the use of the nucleic acid and/or 
proteins as antigens/immunogens and in detection/diagnosis, as well as methods for 
screening the proteins/nucleic acid sequences as potential anti-microbial targets. 

Streptococcus pneumoniae, commonly referred to as the pneumococcus, is an 
important pathogenic organism. The continuing significance of Streptoccocus 
pneumoniae infections in relation to human disease in developing and developed 
countries has been authoritatively reviewed (Fiber, G.R., Science, 265: 1385-1387 
(1994)). That indicates that on a global scale this organism is believed to be the 
most common bacterial cause of acute respiratory infections, and is estimated to 
result in 1 million childhood deaths each year, mostly in developing countries 
(Stansfield, S.K., Pediatr. Infect. Dis., 6: 622 (1987)). In the USA it has been 
suggested (Breiman et al, Arch. Intern. Med., 150: 1401 (1990)) that the 
pneumococcus is still the most common cause of bacterial pneumonia, and that 
disease rates are particularly high in young children, in the elderly, and in patients 
with j predisposing conditions such as asplenia, heart, lung and kidney disease, 
diabetes, alcoholism, or with immunosupressive disorders, especially AIDS. These 
groups are at higher risk of pneumococcal septicaemia and hence meningitis and 
therefore have a greater risk of dying from pneumococcal infection. The 
pneumococcus is also the leading cause of otitis media and sinusitis, which remain 
prevalent infections in children in developed countries, and which incur substantial 
costs. 

The need for effective preventative strategies against pneumococcal infection is 
highlighted by the recent emergence of penicillin-resistant pneumococci. It has been 
reported that 6.6% of pneumoccal isolates in 13 US hospitals in 12 states were found 
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to be resistant to penicillin and some isolates were also resistant to other antibiotics 
including third generation cyclosporins (Schappert, S.M., Vital and Health Statistics 
of the Centres for Disease Control/National Centre for Health Statistics, 214:1 
(1992)). The rates of penicillin resistance can be higher (up to 20%) in some 
5 hospitals (Breiman et al, J. Am. Med. Assoc., 271: 1831 (1994)). Since the 
development of penicillin resistance among pneumococci is both recent and sudden, 
coming after decades during which penicillin remained an effective treatment, these 
findings are regarded as alarming. 

10 For the reasons given above, there are therefore compelling grounds for considering 
improvements in the means of preventing, controlling, diagnosing or treating 
pneumococcal diseases. 

Various approaches have been taken in order to provide vaccines for the prevention 
15 of pneumococcal infections. Difficulties arise for instance in view of the variety of 
serotypes (at least 90) based on the structure of the polysaccharide capsule 
surrounding the organism. Vaccines against individual serotypes are not effective 
against other serotypes and this means that vaccines must include polysaccharide 
antigens from a whole range of serotypes in order to be effective in a majority of 
20 cases. An additional problem arises because it ahs been found that the capsular 

polysaccharides (each of which determines the serotype and is the major protective 
antigen) when purified and used as a vaccine do not reliably induce protective 
antibody responses in children under two years of age, the age group which suffers 
the highest incidence of invasive pneumococcal infection and meningitis. 

25 

A modification of the approach using capsule antigens relies on conjugating the 
polysaccharide to a protein in order to derive an enhanced immune response, 
particularly by giving the response T-cell dependent character. This approach has 



WO 00/06737 



3 



PCT/GB99/02451 



been used in the development of a vaccine against Haemophilus influenzae. There 
are issues of cost concerning both the multi-polysaccharide vaccines and those based 
on conjugates. 

5 

A third approach is to look for other antigenic components which offer the potential 
to be vaccine candidates. In the present application we provide a group of proteins 
antigens which are secreted/exported proteins. 

10 Thus, in a first aspect the present invention provides a Streptococcus pneumoniae 
protein or polypeptide having a sequence selected from those shown in table 2 
herein. 

A protein or polypeptide of the present invention may be provided in substantially pure 
15 form. For example, it may be provided in a form which is substantially free of other 
proteins. 

In a preferred embodiment, a protein or polypeptide having an amino acid sequence as 
shown in Table 3 is provided. 

20 

The invention encompasses any protein coded for by a nucleic acid sequence as shown 
in Table 1 herein. 

As discussed herein, the proteins and polypeptides of the invention are useful as 
25 antigenic material. Such material can be "antigenic" and/or "immunogenic". 
Generally, "antigenic" is taken to mean that the protein or polypeptide is capable of 
being used to raise antibodies or indeed is capable of inducing an antibody response in 
a subject. "Immunogenic" is taken to mean that the protein or polypeptide is capable of 
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eliciting a protective immune response in a subject. Thus, in the latter case, the protein 
or polypeptide may be capable of not only generating an antibody response and in 
addition non-antibody based immune responses. 

5 



The skilled person will appreciate that homologues or derivatives of the proteins or 
10 polypeptides of the invention will also find use in the context of the present invention, 
ie as antigenic/immunogenic material. Thus, for instance proteins or polypeptides 
which include one or more additions, deletions, substitutions or the like are 
encompassed by the present invention. In addition, it may be possible to replace one 
amino acid with another of similar "type". For instance replacing one hydrophobic 
15 amino acid with another. One can use a program such as the CLUSTAL program to 
compare amino acid sequences. This program compares amino acid sequences and 
finds the optimal alignment by inserting spaces in either sequence as appropriate. It is 
possible to calculate amino acid identity or similarity (identity plus conservation of 
amino acid type) for an optimal alignment. A program like BLASTx will align the 
20 longest stretch of similar sequences and assign a value to the fit. It is thus possible to 
obtain a comparison where several regions of similarity are found, each having a 
different score. Both types of analysis are contemplated in the present invention. 

In the case of homologues and derivatives, the degree of identity with a protein or 
25 polypeptide as described herein is less important than that the honlologue or derivative 
should retain its antigenicity or immunogenicity to streptoccocus pneumoniae. 
However, suitably, homologues or derivatives having at least 60% similarity (as 
discussed above) with the proteins or polypeptides described herein are provided. 
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Preferably, homologues or derivatives having at least 70% similarity, more preferably 
at least 80% similarity are provided. Most preferably, homologues or derivatives 
having at least 90% or even 95% similarity are provided. 

5 In an alternative approach, the homologues or derivatives could be fusion proteins, 
incorporating moieties which render purification easier, for example by effectively 
tagging the desired protein or polypeptide. It may be necessary to remove the "tag" or 
it may be the case that the fusion protein itself retains sufficient antigenicity to be 
useful. 

10 

In an additional aspect of the invention there are provided antigenic fragments of the 
proteins or polypeptides of the invention, or of homologues or derivatives thereof. 

For fragments of the proteins or polypeptides described herein, or of homologues or 
15 derivatives thereof, the situation is slightly different. It is well known that is possible to 
screen an antigenic protein or polypeptide to identify epitopic regions, ie those regions 
which are responsible for the protein or polypeptide's antigenicity or immunogenicity. 
Methods for carrying out such screening are well known in the art. Thus, the fragments 
of the present invention should include one or more such epitopic regions or be 
20 sufficiently similar to such regions to retain their antigenic/immunogenic properties. 
Thus, for fragments according to the present invention the degree of identity is perhaps 
irrelevant, since they may be 100% identical to a particular part of a protein or 
polypeptide, homologue or derivative as described herein. The key issue, once again, is 
that the fragment retains the antigenic/immunogenic properties. 

25 

Thus, what is important for homologues, derivatives and fragments is that they possess 
at least a degree of the antigenicity /immunogenicity of the protein or polypeptide from 
which they are derived. 
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Gene cloning techniques may be used to provide a protein of the invention in 
substantially pure form. These techniques are disclosed, for example, in J. Sambrook 
et al Molecular Cloning 2nd Edition, Cold Spring Harbor Laboratory Press (1989). 
Thus, in a fourth aspect, the present invention provides a nucleic acid molecule 
comprising or consisting of a sequence which is: 

any of the DNA sequences set out in Table 1 or their RNA equivalents; 

a sequence which is complementary to any of the sequences of (i); 

a sequence which codes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 

a sequence which is has substantial identity with any of those of (i), (ii) 
and (iii); 

a sequence which codes for a homologue, derivative or fragment of a 
protein as defined in Table 1 . 

In a fifth aspect the present invention provides a nucleic acid molecule comprising or 
consisting of a sequence which is: 

(i) any of the DNA sequences set out in Table 4 or their RNA equivalents; 



(i) 

(ii) 

(iii) 

(iv) 

(v) 



(ii) a sequence which is complementary to any of the sequences of (i); 
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(iii) a sequence which codes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 

(iv) a sequence which is has substantial identity with any of those of (i), (ii) 
5 and (iii); 

(v) a sequence which codes for a homologue, derivative or fragment of a protein 
as defined in Table 4. 

10 The nucleic acid molecules of the invention may include a plurality of such sequences, 
and/or fragments. The skilled person will appreciate that the present invention can 
include novel variants of those particular novel nucleic acid molecules which are 
exemplified herein. _ Such variants are encompassed by the present invention. These 
may occur in nature, for example because of strain variation. For example, additions, 

15 substitutions and/or deletions are included. In addition, and particularly when utilising 
microbial expression systems, one may wish to engineer the nucleic acid sequence by 
making use of known preferred codon usage in the particular organism being used for 
expression. Thus, synthetic or non-naturally occurring variants are also included within 
the scope of the invention. 

20 

The term "RNA equivalent" when used above indicates that a given RNA molecule has 
a sequence which is complementary to that of a given DNA molecule (allowing for the 
fact that in RNA "U" replaces "T" in the genetic code). 

25 When comparing nucleic acid sequences for the purposes of detenriining the degree of 
homology or identity one can use programs such as BESTFIT and GAP (both from the 
Wisconsin Genetics Computer Group (GCG) software package) BESTFIT, for 
example, compares two sequences and produces an optimal alignment of the most 
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similar, segments. GAP enables sequences to be aligned along their whole length and 
finds the optimal alignment by inserting spaces in either sequence as appropriate. 
Suitably, in the context of the present invention compare when discussing identity of 
nucleic acid sequences, the comparison is made by alignment of the sequences along 
their whole length. 

Preferably, sequences which have substantial identity have at least 50% sequence 
identity, desirably at least 75% sequence identity and more desirably at least 90 or at 
least 95% sequence identity with said sequences. In some cases the sequence identity 
may be 99% or above. 

Desirably, the term "substantial identity" indicates that said sequence has a greater 

degree of identity with any of the sequences described herein than with prior art nucleic 

t 

acid sequences. 

It should however be noted that where a nucleic acid sequence of the present invention 
codes for at least part of a novel gene product the present invention includes within its 
scope all possible sequence coding for the gene product or for a novel part thereof. 

The nucleic acid molecule may be in isolated or recombinant form. It may be 
incorporated into a vector and the vector may be incorporated into a host. Such vectors 
and suitable hosts form yet further aspects of the present invention. 

Therefore, for example, by using probes based upon the nucleic acid sequences 
provided herein, genes in Streptococcus pneumoniae can be identified. They can then 
be excised using restriction enzymes and cloned into a vector. The vector can be 
introduced into a suitable host for expression. 
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Nucleic acid molecules of the present invention may be obtained from S.pneurhoniae by 
the use of appropriate probes complementary to part of the sequences of the nucleic 
acid molecules. Restriction enzymes or sonication techniques can be used to obtain 
appropriately sized fragments for probing. 

Alternatively PCR techniques may be used to amplify a desired nucleic acid sequence. 
Thus the sequence data provided herein can be used to design two primers for use in 
PCR so that a desired sequence, including whole genes or fragments thereof, can be 
targeted and then amplified to a high degree. One primer will normally show a high 
degree of specificity for a first sequence located on one strand of a DNA molecule, and 
the other primer will normally show a high degree of specificity for a second sequence 
located on the complementary strand of the DNA sequence and being spaced from the 
complementary sequence to the first sequence. 

Typically primers will be at least 15-25 nucleotides long. 

As a further alternative chemical synthesis may be used. This may be automated. 
Relatively short sequences may be chemically synthesised and ligated together to 
provide a longer sequence. 

In yet a further aspect the present invention provides an immunogenic/antigenic 
composition comprising one or more proteins or polypeptides selected from those 
whose sequences are shown in Tables 2-4, or homologues or derivatives thereof, 
and/or fragments of any of these. In preferred embodiments, the 
immunogenic/antigenic composition is a vaccine or is for use in a diagnostic assay. 



In the case of vaccines suitable. additional excipients, diluents, adjuvants or the like 
may be included. Numerous examples of these are well known in the art. 
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It is also possible to utilise the nucleic acid sequences shown in Table 1 in the 
preparation of so-called DNA vaccines. Thus, the invention also provides a vaccine 
composition comprising one or more nucleic acid sequences as defined herein. The 
5 use of such DNA vaccines is described in the art. See for instance, Donnelly et al , 
Ann. Rev. Immunol, 15:617-648 (1997). 

As already discussed herein the proteins or polypeptides described herein, their 
homologues or derivatives, and/or fragments of any of these, can be used in methods 

10 of detecting/diagnosing S.pneumoniae. Such methods can be based on the detection 
of antibodies against such proteins which may be present in a subject. Therefore the 
present invention provides a method for the detection/diagnosis of S.pneumoniae 
which comprises the step of bringing into contact a sample to be tested with at least 
one protein, or homologue, derivative or fragment thereof, as described herein. 

15 Suitably, the sample is a biological sample, such as a tissue sample or a sample of 
blood or saliva obtained from a subject to be tested. 

In an alternative approach, the proteins described herein, or homologues, derivatives 
and/or fragments thereof, can be used to raise antibodies, which in turn can be used 
20 to detect the antigens, and hence S.pneumoniae. Such antibodies form another aspect 
of the invention. Antibodies within the scope of the present invention may be 
monoclonal or polyclonal. 

Polyclonal antibodies can be raised by stimulating their production in a suitable animal 
25 host (e.g. a mouse, rat, guinea pig, rabbit, sheep, goat or monkey) when a protein as 
described herein, or a homologue, derivative or fragment thereof, is injected into the 
animal. If desired, an adjuvant may be administered together with the protein. Well- 
known adjuvants include Freund's adjuvant (complete and incomplete) and aluminium 
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hydroxide. The antibodies can then be purified by virtue of their binding to a protein as 
described herein. 

Monoclonal antibodies can be produced from hybridomas. These can be formed by 
fusing myeloma cells and spleen cells which produce the desired antibody in order to 
form an immortal cell line. Thus the well-known Kohler & Milstein technique {Nature 
256 (1975)) or subsequent variations upon this technique can be used. 

Techniques for producing monoclonal and polyclonal antibodies that bind to a 
particular polypeptide/protein are now well developed in the art. They are discussed in 
standard immunology textbooks, for example in Roitt et al, Immunology second edition 
(1989), Churchill Livingstone, London. 

In addition to whole antibodies, the present invention includes derivatives thereof which 
are capable of binding to proteins etc as described herein. Thus the present invention 
includes antibody fragments and synthetic constructs. Examples of antibody fragments 
and synthetic constructs are given by Dougall et al in Tibtech 12 372-379 (September 
1994). 

Antibody fragments include, for example, Fab, F(ab') 2 and F v fragments. Fab 
fragments (These are discussed in Roitt et al [supra] ). Fv fragments can be modified 
to produce a synthetic construct known as a single chain Fv (scFv) molecule. This 
includes a peptide linker covalently joining V h and V x regions, which contributes to the 
stability of the molecule. Other synthetic constructs that can be used include CDR 
peptides. These are synthetic peptides comprising antigen-binding determinants. 
Peptide mimetics may also be used. These molecules are usually conformationally 
restricted organic rings that mimic the structure of a CDR loop and that include 
antigen-interactive side chains. 
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Synthetic constructs include chimaeric molecules. Thus, for example, humanised (or 
primatised) antibodies or derivatives thereof are within the scope of the present 
invention. An example of a humanised antibody is an antibody having human 
5 framework regions, but rodent hypervariable regions. Ways of producing chimaeric 
antibodies are discussed for example by Morrison etalin PNAS, 81, 6851-6855 (1984) 
and by Takeda et al in Nature. 314, 452-454 (1985). 

Synthetic constructs also include molecules comprising an additional moiety that 
10 provides the molecule with some desirable property in addition to antigen binding. For 
example the moiety may be a label (e.g. a fluorescent or radioactive label). 
Alternatively, it may be a pharmaiceutically active agent. 

Antibodies, or derivatives thereof, find use in detection/diagnosis of S. pneumoniae. 
15 Thus, in another aspect the present invention provides a method for the 
detection/diagnosis of S.pneumoniae which comprises the step of bringing into contact 
a sample to be tested and antibodies capable of binding to one or more proteins 
described herein, or to homologues, derivatives and/or fragments thereof. 

20 In addition, so-called "Affibqdies" may be utilised. These are binding proteins 
selected from combinatorial libraries of an alpha-helical bacterial receptor domain 
(Nord et al , ) Thus, Small protein domains, capable of specific binding to different 
target proteins can be selected using combinatorial approaches. 

25 

It will also be clear that the nucleic acid sequences described herein may be used to 
detect/diagnose S.pneumoniae. Thus, in yet a further aspect, the present invention 
provides a method for the detection/diagnosis of S.pneumoniae which comprises the 
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step of bringing into contact a sample to be tested with at least one nucleic acid 
sequence as described herein. Suitably, the sample is a biological sample, such as a 
tissue sample or a sample of blood or saliva obtained from a subject to be tested. 
Such samples may be pre-treated before being used in the methods of the invention. 
Trhus, for example, a sample may be treated to extract DNA. Then, DNA probes 
based on the nucleic acid sequences described herein (ie usually fragments of such 
sequences) may be used to detect nucleic acid from S. pneumoniae. 

In additional aspects, the present invention provides: 

(a) a method of vaccinating a subject against S.pnewnoniae which comprises the 
step of administering to a subject a protein or polypeptide of the invention, or a 
derivative, homologue or fragment thereof, or an immunogenic composition of the 
invention; 

(b) a method of vaccinating a subject against S.pneumoniae which comprises the 
step of administering to a subject a nucleic acid molecule as defined herein; 

(c) a method for the prophylaxis or treatment of S.pneumoniae infection which 
comprises the step of administering to a subject a protein or polypeptide of the 
invention, or a derivative, homologue or fragment thereof, or an immunogenic 
composition of the invention; 

(d) a method for the prophylaxis or treatment of S.pneumoniae infection which 
comprises the step of administering to a subject a nucleic acid molecule as defined 
herein; 

(e) a kit for use in detecting/diagnosing S.pneumoniae infection comprising one 
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or more proteins or polypeptides of the invention, or homologies, derivatives or 
fragments thereof, or an antigenic composition of the invention; and 

(f) a kit for use in detecting/diagnosing S.pneumoniae infection comprising one 
or more nucleic acid molecules as defined herein. 

Given that we have identified a group of important proteins, such proteins are 
potential targets for anti-microbial therapy. It is necessary, however, to determine 
whether each individual protein is essential for the organism's viability. Thus, the 
present invention also provides a method of determining whether a protein or 
polypeptide as described herein represents a potential anti-microbial target which 
comprises inactivating said protein and determining whether S.pneumoniae is still 
viable, in vitro or in vivo. 

A suitable method for inactivating the protein is to effect selected gene knockouts, je 
prevent expression of the protein and determine whether this results in a lethal 
change. Suitable methods for carrying out such gene knockouts are described in Li et 
al, P.N.A.S., 94:13251-13256 (1997). 

In a final aspect the present invention provides the use of an agent capable of 
antagonising, inhibiting or otherwise interfering with the function or expression of a 
protein or polypeptide of the invention in the manufacture of a medicament for use in 
the treatment or prophylaxis of S.pneumoniae infection. 

The invention will now be described with reference to the following examples, 
which should not be construed as in any way limiting the invention. The examples 
refer to the figures in which: 
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Figure 1: shows the results of various DNA vaccine trials; and 
Figure 2: shows the results of further DNA vaccine trials. 

EXAMPLE 1 

5 

The Genome sequencing of Streptococcus pneumoniae type 4 is in progress at the 

Institute for Genomic Research (TIGR, Rockville, MD, USA). Up to now, the 
whole sequence has not been completed or published. On 21 st November 1997, the 

10 TIGR centre released some DNA sequences as contigs which are not accurate 
reflections of the finished sequence. These contigs can be downloaded from their 
Webster (www@tigr.orp) , We downloaded these contigs and created a local database 
using the application GCGToBLAST (Wisconsin Package Version 9.1, Genetics 
Computer Group (GCG), Madison, USA). This database can be searched with the 

15 FastA and TfastA procedures (using the method of Pearson and Lipman (PNAS USA, 
85:2444-2448 (1988)). 

Using FastA and TfastA procedures, the local pneumococcus database was searched 
for putative leader sequence or anchor sequence features. Relevant sequences were 
20 used to interrogate for comparative novel sequences. These were: 

(i) already described leader sequences of Streptococcus pneumoniae (from 
proteins NanA, NanB, LytA, Pap A, pep A, PsaA and PspA); 

25 (ii) the leader sequence of Usp45, a secreted protein from Lactococcus lactis; 



(iii) new hypothetical leader sequences derived from the searches in (i) and 

00; 
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(iv) the anchor motif LPxTG, a feature common to many Gram-positive 
bacteria surface proteins which are anchored by a mechanism involving the 
Sortase complex proteins. 

Provided below is an example of this approach, with reference to the sequences 
derived from the database (see table 1). 
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Usp45 



NanA 



PapA PsaA 



PspA 



PrtP 



A AA 1 1 

4121.1 4101.3 4172.4 4106.1 4346.1 4194.1 4172.2 4147.2 



I 



4191.6 



4137.3 4193.1 4122.1 



4170.8 4175.3 



I i I 



4187.2 4188.1 4102.1 



4136.1 4127.1 4191.1 



1 



4119.1 



4119.2 



The protein leader sequences of different known exported proteins were used as a 
starting point for a search of the local pneumococcus database described above. The 
hypothetical proteins found with this search were then submitted to a Blast search in 
general databases such as EMBL, Swissprot etc. Proteins remaining unknown in the 
pneumococcus are kept and annotated. Then the search is performed again using the 
new potential protein leader sequence as a probe, using the TfastA procedure. 
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Example 2: DNA vaccine trials 
pcDNA3.1+ as a DNA vaccine vector 
5 pcDNA3.1 + 

The vector chosen for use as a DNA vaccine vector was pcDNA3.1 (Invitrogen) 
(actually pcDNA3 . 1 + , the forward orientation was used in all cases but may be 
referred to as pcDNA3.1 here on). This vector has been widely and successfully 

10 employed as a host vector to test vaccine candidate genes to give protection against 
pathogens in the literature (Zhang, et al. 9 Kurar and Splitter, Anderson et al). The 
vector was designed for high-level stable and non-replicative transient expression in 
mammalian cells. pcDNA3.1 contains the ColEl origin of replication which allows 
convenient high-copy number replication and growth in E. coli. This in turn allows 

15 rapid and efficient cloning and testing of many genes. The pcDNA3. 1 vector has a 
large number of cloning sites and also contains the gene encoding ampicillin 
resistance to aid in cloning selection and the human cytomegalovirus (CMV) 
immediate-early promoter/enhancer which permits efficient, high-level expression of 
the recombinant protein. The CMV promoter is a strong viral promoter in a wide 

20 range of cell types including both muscle and immune (antigen presenting) cells. 

This is important for optimal immune response as it remains unknown as to which 
cells types are most important in generating a protective response in vivo, A T7 
promoter upstream of the multiple cloning site affords efficient expression of the 
modified insert of interest and which allows in vitro transcription of a cloned gene in 

25 the sense orientation. 

Zhang, D., Yang, X., Berry, J. Shen, C, McClarty, G. and Brunham, R.G. (1997) 
"DNA vaccination with the major outer-membrane protein genes induces acquired 
immunity to Chlamydia trachomatis (mouse pneumonitis) infection". Infection and 
30 Immunity, 176, 1035-40. 

Kurar, E. and Splitter, G.A. (1997) "Nucleic acid vaccination of Brucella abortus 
ribosomal L7/L12 gene elicits immune response". Vaccine, 15, 1851-57. 

35 Anderson, R., Gao, X.-M., Papakonstantinopoulou, A., Roberts, M. and Dougan, 
G. (1996) "Immune response in mice following immunisation with DNA encoding 
fragment C of tetanus toxin". Infection and Immunity, 64, 3168-3173. 

Preparation of DNA vaccines 

40 

Oligonucleotide primers were designed for each individual gene of interest derived 
using the LEEP system. Each gene was examined thoroughly, and where possible, 
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primers were designed such that they targeted that portion of the gene thought to 
encode only the mature portion of the gene protein. It was hoped that expressing 
those sequences that encode only the mature portion of a target gene protein, would 
facilitate its correct folding when expressed in mammalian cells. For example, in the 
5 majority of cases primers were designed such that putative N-terminal signal peptide 
sequences would not be included in the final amplification product to be cloned into 
the pcDNA3. 1 expression vector. The signal peptide directs the polypeptide 
precursor to the cell membrane via the protein export pathway where it is normally 
cleaved off by signal peptidase I (or signal peptidase II if a lipoprotein). Hence the 
10 signal peptide does not make up any part of the mature protein whether it be 

displayed on the surface of the bacteria surface or secreted. Where a N-terminal 
leader peptide sequence was not immediately obvious, primers were designed to 
target the whole of the gene sequence for cloning and ultimately, expression in 
pcDNA3.1. 

15 

Having said that, however, other additional features of proteins may also affect the 
expression and presentation of a soluble protein. DNA sequences encoding such 
features in the genes encoding the proteins of interest were excluded during the 
20 design of oligonucleotides. These features included: 

1. LPXTG cell wall anchoring motifs. 

2. LXXC ipoprotein attachment sites. 

3. Hydrophobic C-terminal domain. 

25 4. Where no N-terminal signal peptide or LXXC was present the start codon was 
excluded. 

5. Where no hydrophobic C-terminal domain or LPXTG motif was present the stop 
codon was removed. 

30 Appropriate PCR primers were designed for each gene.of interest and any and all of 
the regions encoding the above features was removed from the gene when designing 
these primers. The primers were designed with the appropriate enzyme restriction 
site followed by a conserved Kozak nucleotide sequence (in all cases) GCCACC was 
used. The Kozak sequence facilitates the recognition of initiator sequences by 

35 eukaryotic ribosomes) and an ATG start codon upstream of the insert of the gene of 
interest. For example the forward primer using a BamHl site the primer would 
begin GCGGGATCCGCCACCATG followed by a small section of the 5' end of the 
gene of interest. The reverse primer was designed to be compatible with the forward 
primer and with a Notl restriction site at the 5* end in all cases (this site is 

40 TTGCGGCCGC). 



PCR primers 
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The following PCR primers were designed and used to amplify the truncated genes 
of interest. 



ID210 

Forward Primer 5' CGGATCCGCCACCATGTCTTCTAATGAATCTGCCGATG 
3' 

1 0 Reverse Primer 5 ' TTGCGGCCGCCTGTTTAGATTGGATATCTGTAAAGACTT 
3' 

4172,5 

15 Forward Primer 5' 

CGCGGATCCGCCACCATGGATTTTCCTTCAAATTTGGAGG 3' 
Reverse Primer 5" TTGCGGCCGCACCGTACTGGCTGCTGACT 3 ' 



20 



30 



ID211 

Forward Primer 5' 

CGGATCCGCC ACCATG AGTGAGATCA AAATTATTAACGC 3 ' 
Reverse Primer 5' TTGCGGCCGCCGTTCCATGGTTGACTCCT 3' 



25 4197.4 



Forward Primer 5' CGCGGATCCGCCACCATGTGGGACATATTGGTGGAAAC 
3' 

Reverse Primer 5' TTGCGGCCGCTTCACTTGAGCAAACTGAATCC 3' 
4122.1 



Forward Primer 5' 

CGCGGATCCGCCACCATGTCAC AAGAAAAAACAAAAAATGAA 3 ' 
35 Reverse Primer 5' TTGCGGCCGCATCGACGTAGTCTCCGCC 3 ' 

4126.7 

Forward Primer 5' 

40 CGCGGATCCGCCACCATGCTGGTTGGAACTTTCTACTATCAAT 3' 
Reverse Primer 5' TTGCGGCCGCAACTTTCGTCCCTTTTTGG 3' 



WO 00/06737 PCT/GB99/02451 

21 

4188.11 



Forward Primer 5' CGCGGATCCGCCACCATGGGCAATTCTGGCGGAA 3' 
Reverse Primer 5' TTGCGGCCGCTTGTTTCATAGCTTTTTTGATTGTT 3' 

5 

ID209 

Forward Primer 5' 

CGCGGATCCGCCACCATGCTATTGATACGAAATGCAGGG 3 ' 
10 Reverse Primer 5' TTGCGGCCGCAACATAATCTAGTAAATAAGCGTAGCC 3' 

ID215 

Forward Primer 5' CGCGGATCCGCCACCATGACGGCGACGAATTTTC 3' 
15 Reverse Primer 5 ' TTGCGGCCGCTTAATTCGTTTTTGAACTAGTTGCT 3 ' 

4170.4 

Forward Primer 5' 

20 CGCGGATCCGCCACCATGGCTGTTTTTCTTCGCTATCATG 3 1 

Reverse Primer 5' TTGCGGCCGCTTTCTTCAACAAACCTTGTTCTTG 3' 

4193.1 

25 Forward Primer 5' 

CGCGGATCCGCCACCATGGGTAACCGCTCTTCTCGTAAC 3 ' 
Reverse Primer 5' TTGCGGCCGCGCTTCCATCAAGGATTTTAGC 3' 

Cloning 

30 

The insert along with the flanking features described above was amplified using PCR 
against a template of genomic DNA isolated from type 4 S. pneumoniae strain 1 1886 
obtained from the National Collection of Type Cultures. The PCR product was cut 
with the appropriate restriction enzymes and cloned in to the multiple cloning site of 

35 pcDNA3.1 using conventional molecular biological techniques. Suitably mapped 
clones of the genes of interested were cultured and the plasmids isolated on a large 
scale (> 1.5 mg) using Plasmid Mega Kits (Qiagen). Successful cloning and 
maintenance of genes was confirmed by restriction mapping and sequencing ~700 
base pairs through the 5' cloning junction of each large scale preparation of each 

40 construct. 



Strain validation 
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A strain of type 4 was used in cloning and challenge methods which is the strain 
from which the 5. pneumoniae genome was sequenced. A freeze dried ampoule of a 
homogeneous laboratory strain of type 4 S. pneumoniae strain NCTC 11886 was 
obtained from the National Collection of Type Strains. The ampoule was opened and 
the cultured re suspended with 0.5 ml of tryptic soy broth (0.5% glucose, 5% 
blood). The suspension was subcultured into 10 ml tryptic soy broth (0.5% glucose, 
5% blood) and incubated statically overnight at 37 °C. This culture was streaked on 
to 5 % blood agar plates to check for contaminants and confirm viability and on to 
blood agar slopes and the rest of the culture was used to make 20% glycerol stocks. 
The slopes were sent to the Public Health Laboratory Service where the type 4 
serotype was confirmed. 

A glycerol stock of NCTC 11886 was streaked on a 5% blood agar plate and 
incubated overnight in a C02 gas jar at 37°C. Fresh streaks were made and optochin 
sensitivity was confirmed. 

Pneumococcal challenge 

A standard inoculum of type 4 5. pneumoniae was prepared and frozen down by 
passaging a culture of pneumococcus lx through mice, harvesting from the blood of 
infected animals, and grown up to a predetermined viable count of around 10 9 
cfu/ml in broth before freezing down. The preparation is set out below as per the 
flow chart. 

Streak pneumococcal culture and confirm identity 



Grow over-night culture from 4-5 colonies on plate above 

i 

V 

Animal passage pneumococcal culture 
(i.p. injection of cardiac bleed to harvest) 

I 

V 

Grow over-night culture from animal passaged pneumococcus 
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I 

V 

5 Grow day culture (to pre-determined optical density) from over-night of animal 

passage and freeze down at -70 °C - This is standard minimum 

I 

V 

10 

Thaw one aliquot of standard inoculum to viable count 

I 

V 

15 

Use standard inoculum to determine effective dose (called Virulence Testing) 

I 

V 

20 

All subsequent challenges - use standard inoculum to effective dose 

An aliquot of standard inoculum was diluted 500x in PBS and used to inoculate the 
mice. 

25 

Mice were lightly anaesthetised using halothane and then a dose of 1.4 x 10 5 cfu of 
pneumococcus was applied to the nose of each mouse. The uptake was facilitated by 
the normal breathing of the mouse, which was left to recover on its back. 

30 S. pneumoniae vaccine trials 

Vaccine trials in mice were carried out by the administration of DNA to 6 week old 
CBA/ca mice (Harlan, UK). Mice to be vaccinated were divided into groups of six 
and each group was immunised with recombinant pcDNA3.1 + plasmid DNA 

35 containing a specific target-gene sequence of interest. A total of 100 /xg of DNA in 
Dulbecco's PBS (Sigma) was injected intramuscularly into the tibialis anterior 
muscle of both legs (50 p\ in each leg). A boost was carried using the same 
procedure 4 weeks later. For comparison, control groups were included in all 
vaccine trials. These control groups were either unvaccinated animals or those 

40 administered with non-recombinant pcDNA3. 1 + DNA (sham vaccinated) only, 

using the same time course described above. 3 weeks after the second immunisation, 
all mice groups were challenged intra-nasally with a lethal dose of S. pneumoniae 
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serotype 4 (strain NCTC 11886). The number of bacteria administered was 
monitored by plating serial dilutions of the inoculum on 5% blood agar plates. A 
problem with intranasal immunisations is that in some mice the inoculum bubbles out 
of the nostrils, this has been noted in results table and taken account of in 
5 calculations. A less obvious problem is that a certain amount of the inoculum for 
each mouse may be swallowed. It is assumed that this amount will be the same for 
each mouse and will average out over the course of innoculations. However, the 
sample sizes that have been used are small and this problem may have significant 
effects in some experiments. All mice remaining after the challenge were killed 3 or 

10 4 days after infection. During the infection process, challenged mice were monitored 
for the development of symptoms associated with the onset of S. pneumoniae 
induced-disease. Typical symptoms in an appropriate order included piloerection, 
an increasingly hunched posture, discharge from eyes, increased lethargy and 
reluctance to move. The latter symptoms usually coincided with the development of 

15 a moribund state at which stage the mice were culled to prevent further suffering. 

These mice were deemed to be very close to death, and the time of culling was used 
to determine a survival time for statistical analysis. Where mice were found dead, 
the survival time was taken as the last time point when the mouse was monitored 
alive. 

20 

Interpretation of Results 

A positive result was taken as any DNA sequence that was cloned and used in 
challenge experiments as described above which gave protection against that 

25 challenge. Protection was taken as those DNA sequences that gave statistically 

significant protection (to a 95% confidence level (p<0.05)) and also those which were 
marginal or close to significant using Mann- Whitney or which show some protective 
features for example there were one or more outlying mice or because the time to the 
first death was prolonged. It is acceptable to allow marginal or non-significant results 

30 to be considered as potential positives when it is considered that the clarity of some of 
the results may be clouded by the problems associated with the administration of 
intranasal infections. 
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Table 1 



ATGGAAGAGTTAGTGACCTTAGATTGTTTGTTTATTGACAGAA 
5 TTGTGTGGAAGAAAACGACAGAGAAATTCTCCGCCAAACTTCAAGAACAGATACAGGTCTATTTTCAAGAAGAAA 
TCACTCCCCTTCTGATTAAATATGCCATGTTTGATAAGAAACAAAAGAGAGGGTATAAAGAGTC AGCT AAAAACT 
TAGCGAATTGGCACTATAATGACAAGGAGGATAGCTACACACATCCTGATGGCTGGTATTATCGTTTTCACCATAC 
CAAATATCAGAAAACACAGACAGACITTCAACAAGAAATCAAGGTTTACTACGCCGACGAACCTC 
TCAAAAGGG ACTGTATATG AACG AACGCTATCAAA ACTTGAAAGCTAAAG AATGTCAGGCGCi i 1 1 ATC TCCCCA 
10 AGGTAGACAGATTTTCGCTCAACGCAAGATTGATGTGGAACCTGTCTTTGGGCAGATA^ 

AAGAGATGTAATCTGAGAGGGAAGCGTCAAGTGAGAATTGACATGGGATTGGTACTTATGGCCAATAACCTCCTA 
AAATATAGTAAAATGAAATAA 

15 ATGGGGAAAGGCCATTGGAATCGGAAAAGAGTTTATAGCATTCGTAAGTTTC 
ATTGGGACTTGTGCAGTTTTATTAGGAGGAAATATAGCTGGAGAATCT^ 

CTCATACTGCTGAGAAACCTAAAGAGGAAAAAATGATAGTAGAAGAAAAGGCTGATAAAGCTTT 
AATATAGTTGAAAGGACAGAACAAAGTGAACCTAGTTCAACTGAGGCTATTGCATCTGAGAAGAAAGAAGATGAA 
GCCGTAACTCCAAAAGAGGAAAAAGTGTCTGCTAAACCGGAAGAAAAAGCTCCAAGGATAGAATCACAAGCTTC 
20 AAATCAAGAAAAACCGCTCAAGGAAGATGCTAAAGCTGTAACAAATGAAGAAGTGAATCAAATGATTGAAGACA 
GGAAAGTGGATTTTAATCAAAATTGGTACTTTAAACTCAATGCAAATTCTAAGG 

ACGT ATCTACGTG G AAAAAATT AG ATTTACCGT ATG A CTGG AGT ATCTTT AACG ATTTCG ATC ATG AATCTCCTG C 
ACAAAATGAAGGTGGACAGCTCAACGGTGGGGAAGCTTGGTATCGCAAGACTTTCAAACTAGATGAAAAAGACCT 
CAAGAAAAATGTTCGCCTTACTTTTGATGGCGTCTACATGGATTCTCAAGTTTATC 

25 CATTATCCAAATGGTTATAACCAGTTCTCATATGATATCACCAAATACCTTCAAAAAGATGGTCGTGAGAAT GTGA 
TTGCTGTCCATGCAGTCAACAAACAGCCAAGTAGCCGTTGGTATTCAGGAAGTGGTATCTATCGTGATGTGACTTT 
ACAAGTGACAGATAAGGTGCATGTTGAGAAAAATGGGACAACTATTTTAACACCAAAACTTGAAGAACAACAACA 
TGGCAAGGTTGAAACTCATGTGACCAGCAAAATCGTCAATACGGACGACAAAGACCATGAACTTGTAGCCGAATA 
TCAAATCGTTGAACGAGGTGGTCATGCTGTAACAGGCTTAGTTCGTACAGCGAGTCGTACCTTAAAAGCACATGA 

30 ATCAACAAGCCTAGATGCGATTTTAGAAGTTGAAAGACCAAAACTCTGGACTGTTTTA 

TACGAATTGATTACGCGTGTTTACCGTGACGGTCAATTGGTTGATGCTAAGAAGGATTTGTTTGGTTAC 
' ATCACTGGACTCCAAATGAAGGTTTCTCTTTGAATGGTGAACGTATTAAATTCCATC 
CCATGGGGCGCTTGGAGCAGAAGAAAACTATAAAGCAGAATATCGCCGTCTCAAACAAATGAAGGAGATGGGAG 
TTAACTCCATCCGTACAACCCACAACCCTGCTAGTGAGCAAACCTTGCAAATCGCAGCAGAACTAGGTTTACTCGT 

35 TCAGGAAGAGGCCTTTGATACGTGGTATGGTGGCAAGAAACCTT^ 

CACTCACCCAGAAGCTCGAAAAGGTGAAAAATGGTCTGATTTTGACCTACGTACCATGGTCGAA AGAG GCAAAAA 

CAACCCTGCTATCTTCATGTGGTCAATTGGTAATGAAATAGGTGAAGCTAATGGTGATGCCCACTCTTTAG 

GTT AAACGTTTG GTT AAG GTT ATC AAGG ATGTTG AT A AG ACTCGCT ATGTT ACC ATGGG AG C AG ATAAATTCCGTT 

TCGGTAATGGTAGCX3GAGGGCATGAGAAAATTGCTGATGAACTCGATGCTGTTGGATTTAACTATTCr 

40 ATTACAAAGCCCTTAGAGCTAAGCATCCAAAATGGTTGATTTATGGATCAGAAACATCTTCAGCTACCCGTACACG 
TGGAAGTTACTATCGCCCTGAACGTGAATTGAAACATAGCAATGGACCTGAGCGTAATTATGAACAGTCAGATTA 
TGGAAATGATCGTGTGGGTTGGGGGAAAACAGCAACCGCTTCATGGACnTTTGACCGTGACAACGCTGGCTATGC 
TGGACAGTTTATCTGGACAGGTACGGACrATATTGGTGAACCTACACCATGGCACAACCAAAATCAAACTCCTGTT 
AAGAGCrCTTACTTrGGTATCGTAGATACAGCCGGCATTCCAAAACATGACTTCTATCT 

45 TTTCTGTTAAG AAG AAACCG ATGGTACACCTTCTTCCTCACTGG AACTG GG AAAACAAAG AATT AGC ATCC AAAG 

TAGCTGACTCAGAAGGTAAGATTCCAGTTCGTGCnTATTCGAATGCnTCTAGTGTAGAATTGTT 
ATCTCTTGGTCTTAAGACnTTCAATAAAAAACAAACCAGCGATGGGCGGACITACCAAGA^ 
TGAACTTTATCTTGAATGGAAAGTTGCCTATCAACCAGGTACCnTGGAAGCAA 

GAAATTGCTCGAGATAAGATTACGACTGCTGGTAAGCCAGCGGCAGTTCGTCTTATTAAGGAAGACCATGCGATT 
50 GCAGCAGATGGAAAAGACnTGACnTACATCTACrATGAAATTGTrGACAGCCAGGGGAATGTGGTTCCAACTGCT 
AATAATCTGGTTCGCITCCAATTGCATGGCCAAGGTCAACTGGTCGGTGTAGATAACGGAGAA 
GAACGCTATAAGGCGCAAGCAGATGGTTCTTGGATTCGTAAAGCATTTAATGGTAAAGGTGTTGCCATTG TCAA AT 
CAACTGAACAAGCAGGGAAATTCACCCTGACTGCCCACTCTGATCTCTTGAAATCGAACCAAGTCACT 
TGGTAAGAAAGAAGGACAAGAGAAGACTGTTTTGGGGACAGAAGTGCCAAAAGTACAGACCATTATTGGAGAGG 
55 CACCTGAAATGCCTACCACTGTTCCGTTTGTATACAGTGATGGTAGCCGTGCAGA 

AGTAGATGTGAGCAAGCCTGGTATTGTAACGGTGAAAGGTATGGCTGACGGACGAGAAGTAGAAGCTCGTGTAGA 

AGTGATTGCTCITAAATCAGAGCrACCAGTTGTGAAACGTATTGCrCCAAATACTGACTTG 

TCTGTTTCCTATGTTTTGATTGATGGAAGTGTTGAAGAGTATGAACT^ 

AAGCTAAGTTAGCAATTCCAGGTTCTCGTATTCAAGCGACCGGTTATTTAGAAGGTCAACCAATTCATGCAACCCr 
60 TGTGGTAGAAGAAGGCAATCCTGCGGCACCTGCAGTACCAACTGTAACGGTTGGTGGTGAGGCAGTAACAGGTCT 
TACTAGTCAAAAACCAATGCAATACCGCACTCTITGCTTATGGAGCTAAGTTGCCAGAAGTCACAGCAAGTGCT^ 
AAATGCAGCTGTTACAGTTCITCAAGCAAGCGCAGCAAACGGCATGCGTGCGAGCATCTTTATrCAGCCTAA^ 
TGGTGGCCCTCTTCAAACCTATGCAATTCAATTCCTTGAAGAAGCGCCAAAAATTGCTCACrr 
GAAAAAGCTGACAGTCTCAAAGAAGACCAAACTGTCAAATTGTCGGTTCGAGCTCACTATCAAGATGGAACGCAA 
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GCTCTATTACCAGCTGATAAAGTAACCTTCTCTACAAGTGGTC 

GAGTTGCATAAGCCAGGAGCAGTCACTCTGAACGCTGAATATGAGGGAGCTAAAGACCAAGTTGAACTCACT 
CAAGCCAATACTGAGAAGAAGATTGCGCAATCCATCCGTCCTGTAAATGTAGTGACAGATTTGCATCAGGAACCA 
AGTCTTCCAGCAACAGTAACAGTTGAGTATGACAAAGGTTTCCCTAAAACTCATAAAGTCACTTC 
5 CGAAAGAAAAACTAGACTCCTATCAAACATTTGAAGTACTAGGTAAAGTTGAAGGAATTGACCTTGAAGCGCG 

CAAAAGTCTCTGTAGAAGGTATCGTTTCAGTTGAAGAAGTCAGTGTGACAACTCCAATCGCAGAAGCACCACAAT 
TACCAGAAAGTGTTCGGACATATGATTCAAATGGTCACGTTTCATCAGCTAAGGTTGCATGGGATGCGATTCGTCC 
AGAGCAATACGCTAAGGAAGGTGTCTTTACAGTTAATGGTCGCTrrAGAAGGTACGCAATTAACAACT 
TGTTCGCGTATCTGCTCAAACTGAGCAAGGTGCAAACATTTCTGACCAATGGACCGGTTCAGAATTC 

10 TTTGCTTCAGACTCAAATCCAAGCGACCCAGTTTCAAATGTTAATGACAAGCTCATTTCCT 

CCAATCGTTGGACAAACTGGAATCGTACTAATCCAGAAGCTTCAGTCGGTGTTCTGTTTGGAGATTCAGGTATCTT 
GAGCAAACGCTCCGTTGATAATCTAAGTGTCGGATTCCATGAAGACCATGGAGTTGGTGTACCGAAGTCTTATGTG 
ATTGAGTATTATGTTGGTAAGACTGTCCCAACAGCTCCTAAAAACCCTAGTTTTGTTGGTAATGAGG 
TTAATGATTCTGCCAACTGGAAACCAGTTACTAATCTAAAAGCCCCTGCTCAACTCAAGGCT 

15 ACTTTAGCTTTGATAAAGTTGAAACCTATGCTGTTCGTATTC 

TATCACAGAGGTACAAATCTTTGCGAAACAAGTTGCGGCAGCCAAGCAAGGACAAACAAGAATCCAAGTTGACGG 
CAAAGACTTAGCAAACTTCAACCCTGATTTGACAGACTACTACCITGAGTCTGTAGATGGAAAAGTTCC^ 
ACAGCAAGTGTTAGCAACAATGGTCTCGCTACCGTCGTTCCAAGCGTTCGTGAAGGTGAGCCAGTTCGTGTCATCG 
CGAAAGCTGAAAATGGCGACATCTTAGGAGAATACCGTCTGCACTTCACTAAGGATAAGAGCTT^ 

20 AACCAGTTGCTGCGGTTAAACAAGCTCGCTTGCTACAAGTAGGTCAAGCACTTGAATTGCCGA 

TTACTTCACAGGTAAAGACGGCTACGAAACAAAAGACCTGACAGTTGAATGGGAAGAAGTTCCAGCGG^ 

GACAAAAGCAGGTCAATTTACTGTTCGAGGCCGTGTCCTTGGTAGTAACCTTGTTGCTGAGATCACT 

ACAGACAAACTTGGTGAGACTCTTTCAGATAACCCTAACTATGATGAA 

ACCAATGATATTGACAAAAACTCTCATGACCGCGTTGACTATCTCAATGACGGAGATCATTCAGAAAATCGTCGTT 
25 GGACAAACTGGTCACCAACACCATCrrCTAATCCAGAAGTATCAGCGGGTGTGATTTTCCGTGAAAATGGTAAGA 
TTGTAGAACGGACTGTTACACAAGGAAAAGTTCAGTTCnTTGCAGATAGTGGTACGGATGCACCATCTAAACTCGT 
TTTAGAACGCTATGTCGGTCCAGAGTTTGAAGTGCCAACCTACTATTCAAACTACCAAGCCTACGACGCAGACCAT 
CCATTCAACAATCCAGAAAATTGGGAAGCTGTTCCTTATCGTGCGGATAAAGACATTGCAGCTGGTGATGAAATC 
AACGTAAC ATTTAAAGCTATCAAAGCCAAAG CTATG AG ATGG CGTATGG AGCGTAAAGCAG ATAAG AG CGGTGTT 
30 GCGATGATTGAGATGACCTTCCrrGCACCAAGTGAATTGCCTCAAGAAAGCACTCAATCAAAGATTCT 
GAAAAGAACTTGCTGATTTCGCTGAAAATCGTCAAGACTATCAAATTACCTATAA AGG 
CACnTGAAGAAAACAATCAAGTAGCITCAACTG^ 

CGTTTCAGAAAGTGGAAAACAAGTCAAGGAATACCGTATCCACTTGACTAAGGAA 
AAACCAGTTTCTGAGAAGACAGTTGCTGCTGTACAAGAAGATCITCCAAAAATCGAATTTGT^ 

35 GCATACAAGACAGTTGAGAAAAAAGATTCAACACTGTATCTAGGTGAAACTCGTGTAGAACAAGAAGGAAAAGTT 
GGAAAAGAACGTATCTTTACAGCGATTAATCCTGATGGAAGTAAGGAAGAAAAACTCCGTGAAGTGGTAGAAGTT 
CCGACAGACCGCATCGTCTTGGTTGGAACCAAACCAGTAGCTCAAGAAGCTAAAAAACCACAAGTGTCAGAAAAA 
GCAGATACAAAACCAATTGATTCAAGTGAAGCTAGTCAAACTAATAAAGCCCAGTTACCAAGTACAGGTAGTGCG 
GCAAGCCAAGCAGCAGTAGCAGCAGGTTTAACTCnTCTAGGTTTGAGTGCAGGATTAGTAGTTACTAAAGGTAAA 

40 AAAGAAGACTAG 



4101.5 

ATGGATGCAATCTTTGACCTAATCGGAAAGGTTTTCAATCCCA 

45 TCATTTTGACAGTATTGGCTTTACTTTTTGGAGTGAAATTCTCCAAAGC 

CGCTCTTACAGGTATCGGTGCTATCATCGGTATGCTAAACACTGCTTTCTCAGCATCACTAGCAAAATTCGTTGAA 
AACACTGGTATCCAATTGAGTATTACCXSACGTTGGTTGGGCACCACTTGCTACAATCACTTGGGGT^ 
CACTATACTTCTTGCTCATCATGTTGATTGTCAACATAGTGATGCTAGCTATGAAGAAAACAGATACACTTGATGT 
CGATATCnTTGATATCTGGCACTTGTCTATCACAGGTCTCTTGATTAAATGGTATGCTGATAACAATGGTGTGAGT 

50 CAAGGGGTTTCACTCTITATTGCTACAGCAGCTATCGTCCnTGTCGGTGTGTTGAAAATTATCAACT 

GAAACCTACATTTGATGACCTTCTTAACGCCCCAAGTTCATCACCAATGACATCAACTCACATG AACTA CATGATG 

AACCCAGTTATCATGGTTTTGGATAAGATTTTTGAAAAATTCTTCCCAGGCCTTGATAAATATGA 

CTAAATTGAACAAGAAAATCGGTTTCTGGGGATCTAAATTCTTCATCGGTTTC 

ATGGGAACTCCACATCCAATTGCAGGTGTTGCAGATGCAGATAAATGGCGTCTTGTTATCAAAGGATGGTTGTCTC 
55 TTGGTTTGACTGCCGGTGTATCTTTGGAACTCTTCTCACITATCGGTTCATGGTTCATCGCA 

TC AC AAGGT ATTACAAACGTTG CT ACT A A ACGTCTTC AAGG ACGT AA ATTC A AT ATCG GTCTTG ACTG G C C ATTC A 
TCGCTGGTCGTGCrGAAATCTGGGCirGTGCCAACGTACTTGCACCAATCATGTTGATTGAAGCAGTGClM'CITTC 
AAAAGTTGGAAATGGTATCnTGCCACTTGCAGGTATCATCGCrATGGGTGTTACrCCAGCTCTC^ 
CGTGGTAAATTGCTCCGTATG ATTATCHTCGG AACACrCITGTTGCCACTCTTCCrrCl' I T CAGGTACACTTATTGC 
60 ACCATTTGCAACAGAACTTGCTAAAGGTGTAGGTGCCnTCCCAGAAGGTGTGAGCCAAACrCAATTGATTACTCAC 
TCTACrCTTGAAGGACCAATCGAAAAACTTCTTGGTTC 
TCCTTGGTGCAGTAGTCITCCITGTATTCTATATCGGTATCnTT 
CG AAG AGT ACG C AG C AAAAGC AAAATAA 

65 4102.1 
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ATG AAG ATT ATG AAAAAAAAAT ATTGGACTTTAGCG ATATT ATTC rTTTGTT TG I " I'CAAT AATTCTGTT ACTGCTCA 
AGAAATACCTAAAAATCTTGATGGCAATATAACTCACACTCAGACTAGCGAAAGTTITTCT 

ACAGGTTGACTATTCTAATAAAAATCAAGAAGAAGTAGACCAAAATAAATITCGTATTCAAATCGATAAGACAGA 

ATTATTTGTAACAACAGATAAACATTTAGAAAAAAACTGTTC^ 

TATTGTTAACTCTGAAAGTAATAATTTACTAGGCGAAGA 

CTAGATAATAGAGGAGGAAATATAGAGCATGACAAAGATAACTTAGAATCGTCGATTGTAAGAAAATATGAATGG 

GATA TAGATAAAGTTACTCGTGGAGGCGAAAGTTATAAATTATATTCTAAAAGTAATTCTAAAGTTTCAATTG 

TT^AGATTCAGGAGTCGATTTACAAAATACTGGATTACTGAAAAATCTTTCA^ 

C AATAAAGG ATATTTAGG AA AAG AGG AGG GAG AGG AAGG AATA ATATC AG ATATTCAAG ATAG ATTAGGTCATG 

GTACGGCTCTTGTAGCTCAAATTGTAGGGGATGACAATATTAATGGAGTAAATCCTCACGTTAATATTAACGTCTA 

TAGAATATTTGGTAAGTCGTCAGCTAGTCCAGATTGG ATTGTAAAAGCAA I ' l Tl ' l GATGCTGTAGATGATGGCAAT 

GATATTATCAATCTTAGTACTGGACAATATTTAATGATTGATGGAGAATATGAGGACGGAACAAATGATTTTGAAA 

CATTTTTGAAGTATAAAAAGGCTATTGATTACGCGAATCAAAAAGGAGTAATTATAGT^ 

ACTCCCTAAATGTATCAAATCAGTCAGATTTATTGAAACTTATTAGTTCA 

AGTAGTTGATGTTCCAAGTTATTTCTCATCTACAATTTCGGTCGGAGGCATAGATCGCTTAG 

GTAATTTATCAGATTTTAGCAATAAAGGGGATTCTGATGCAATATATGCGCCTGCAGGCTCAACATTATCTCTTTC 

AGAATTAGGACTTAATAACTTTATTAATGCAGAAAAATATAAAGAAGATTGGATrrm 

ATATACGTATCTTTATGGAAACTCATTTGCTGCTCCTAAAGTTTCTGGTGCGATTGCAATGACT 

AATTAAAAG ATC AG C CCT AT AATT AT ATGTTTGT AAAAAAATTCTGG AAG AAACATTACC AGT AA 

4106.1 

ATGAAGAAAACATGGAAAGTGTTTTTAACGCTTGTAACAGCTCTTGTAGCTGTTGTGCTTGTGGCCT 
GAACTGCTTCTAAAGACAACAAAGAGGCAGAACTTAAGAAGGTTGA 

ACCACACAGGGCTTTATGTTGCCAAGGAAAAAGGTTATTrCAAAGAAGCTGGAGTGGATGTTGATTTGAAATTGC 

CACCAGAAGAAAGTTCTTCTGACTTGGTTATCAACGGAAAGGCACCATTTGCAGTGTATTTCCAAGACTACAT^ 

TAAGAAATTGGAAAAAGGAGCAGGAATCACTGCCGTTGCAGCTATTGTTGAACACAATACATCAGGAATCATCTC 

TCGTAAATCTGATAATGTAAGCAGTCCAAAAGACTTGGTTGGTAAGAAATATGGGACATGGAATGACCCAACTGA 

ACTTGCTATGTTGAAAACCTTGGTAGAATCTCAAGGTGGAGACTTTGAGAAGGTTGAAAAAGTACCAAATAACGA 

CTCAAACTCAATCACACCGATTGCCAATGGCGTCTTTGATACTGCTTGGATTTACTACGGTTGGGATGGT^ 

GCTAAATCTCAAGGTGTAGATGCTAACTTCATGTACTTGAAAGACTATGTCAAGGAGTTTGACTACT 

TTATCATCGCAAACAACGACTATCTGAAAGATAACAAAGAAGAAGCTCGCAAAGTCATCCAAGCCATCAAAAAAG 

GCTACCAATATGCCATGGAACATCCAGAAGAAGCTGCAGATATTCTCATCAAGAATGCACCTGAACTCAAGGAAA 

AACGTGACTTTGTCATCGAATCTCAAAAATACnTGTCAAAAGAATACGCAAGCGACAAGGAAAAATGGGGTCAAT 

TTGACGCAGCTCGCTGGAATGCnTTCTACAAATGGGATAAAGAAAATGGTATCCTTAAAGAAGACTTGACAG 

AAG G CTTC ACC AACG AATTTGTG AAAT AA 

4106.4 

ATG AT AAAAA ATCCT AAATT ATTAACC AAGTC rTTTTl ' AAG AAGTTTTGC AATTCT AGGTGGTGTTG GTCT AGTC A 
TrCATATAGCTATTTATTTGACCHTTCCTTTTrATTATATTCAACT^ 

AGTGTTTACGGAGTATTTAAAGACTAAGACATCTGATGAAATTCCAAGCTTACTCCAGTCTTATTCAAACT 
ACCATATCTGCTCACCTTAAAAGAGATATTGTAGATAAGCGGCTCCCTCTTGTGCATGACTTGGATATTAAAGATG 
GAAAGCTATCAAATTATATCGTGATGTTAGATATGTCTGTTAGTACAGCAGATGGTAAACAGGTAACCGTGCAATT 
TGTT CACGGGGTGGA T GTCTA CAA AGAAG CAAAGAATATTTTGCTTTTGTATCTCCCATAT ACA 1' ITl'l GGTTACA 
ATTGCnTTTTCCnTrGTTTTTTCrTATTTITA 

AGTAAAATGCAAGATTTGGATGACAATATTCGTTTTGATGAAAGTAGGAAAGATGAAGTTGGTGAAGTTGGAAA^ 
CAGATTAATGGTATGTATGAGCACTTGTTGAAGGTTATTTATGAGTTGGAAAGTCGTAATGAGCAAATTGTAAAAT 
TGCAAAATGAAAAGGTTTCCTTTGTCCGCGGAGCATCACATGAGTTGAAAACCC 

CCTAGAGAATATGCAGCATAATATTGGAGATTACAAAGATCATCCAAAATATATTGCAAAGAGTATAAATAAGAT 

TGACCAGATGAGCCACTTATTAGAAGAAGTACTGGAGTCTTCTAAATTCCAAGAGTGGACAGAGTGTCGTGAGAC 

CTTGACTGTTAAGCCAGTTTTAGTAGATATTTTATCACGTTATCAAGAATTAGCTCATTCAATAGGTGTTACAAT^ 

AAAATCAATTGACAGATGCTACCAGGGTCGTCATGAGTCTTAGGGCATTGGATAAGGTTTTGACAAACCTGATTA 

GTAATGCAATTAAATATTCAGATAAAAATGGGCGTGTAATCATATCCGAGCAAGATGGCTATCTCTCTATCAAAA 

ATACATGTGCGCCTCTAAGTGACCAAGAACTAGAACATTTATTTGATATATTCTATCATTCTCAAATCGTGACAGA 

TAAGGATGAAAGTTCCGGTTTGGGTCTTTACATTGTGAATAATATTTTAGAAAGCTATCAAATGGAT^ 

CTCCCTTATGAACACGGTATGGAATTTAAGATTAGCTTGTAG 

4106.6 

ATGTATTTAGGAGATTTGATGGAGAAAGCCGAGTGTGGTCAATTTTCAATACTTTCCTTTCTATTACAA 

AGACGACCGTCAAGGCTGTAATGGAAGAAACAGGATTTTCAAAAGCAACCCTAACCAAATATGTCACCCTGCTCA 

ATGACAAGGCTTTGGATAGTGGCTTAGAGCTGGCTATTCACTCAGAAGATGAAAATCTGCGTCTGTCTATCGGTGC 

AGCTACCAAGGGGAGAGATATTCGGAGCTTGTTriTGGAGAGTGCTGTTAAATACCAGATTTT 

TACCACCAACAGTTTTTAGCCCATCAGCTGGCTCAAGAATTGGTGATTAGCGAGG 

CTGGTTTAAATCAGATTT TGTCAG AATTTGATTTATCCATCCAAAATGGCCGTTGGCGAGGTCCAGAGCATCAGAT 
TCACTATTTCTATTTCTGTCTTTTCCGAAAGGTCTGGTCGAGTCAGGAATGGGAAGGTCACATGCAGAAACCAGAG 
AGAAAACAGGAGATTGCCAATTTAGAGGAAATCTGCGGTGCAAGTTTGTCTGCGGGGCAGAAATTGGACTTGGTT 
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CTCTGGCCTCACATCAGTCAACAACGTCTTCGGGTCAATGCTTGTCAGTTTCAAGTCATAGAAGAGAAAATGC^ 
GGGTATTTTGACAATATCTTTTATCTTCGTTTGCTGAGAAAGGTTCCGTCCrn^ 
AGGAGTTGAGGATGGTGAGATGATGATATTCTTCTCnTTTCTCCTATCTCATCGCATTCITCCT 
AGTATATTCTrGGTTTTGGAGGGCAGTTGGCAGATTTACTGACGCAATTGATTCAAGAAATGAAGAAGGAGGAAC 

5 TATTCGGGGATTATACAGAGGACCATGTCACCTATGAACTCAGTCAGC^ 

CTATATTTTACAGGATCGCTACAAGTACCAGTTAGAGAATCGT(^TCCATATTTACTGATGGAACATGATTTTAAA 
GAGACAGCAGAGGAGATTTTTCATGCTCTACCTGCTTTTCAACAGGGGAC^GATTTA 

AATGGCTCCAGTTAATCGAATATATGGCTGAAAACGGTGGCCAGCATATGCGGATTGGTCTGGATTTGACATCTGG 
TTTTCTTGTCTTTTC AAGG ATGGC AG C C ATTTTG AAACGGT ATTTGG AATAC AATCGTTTTATT ACC ATTG AAG CTT 
10 ATG ACCCT AGTCG GC ATT ATG ATTTG CTGGTT ACC AATAACCCG ATTCAT AAG AAGG AA CAG ACACC AGTCT ATT A 

TTTAAAAAATGACITGGATATGGAGGATTTGGTAGCGATTCGCCAGTTATTATTCACTTAA 

4106.7 

ATGGAATTTTCAAAGAAAACACGTGAATTGTCAATTAAAAAAATGCA 

GGTGGAGGAATCACAGGAGCrGGTGTAGCCTTGCAGGCGGCAGCTAGCGGTCTTGAGACTGGTTTGATTGAAATG 
CAAGACrTTGCAGAAGGAACATCTAGTCGTTCAACAAAATTGGTTCACGGAGGACnTCGTTACCrCAAACA^ 
ACGTAGAAGTGGTCTCAGATACGGTTTCTGAACGTGCAGTGGTTCAACAAATCGCTCCACACATTCCAAAATCAG 
ATCCAATGCTCTTACCAGTTTACGATGAAGATGGAGCAACCnTTAGCCrCn^ 
GTACGACCTCTTGGCAGGTGTTAGCAACACACCAGCTGCGAACAAGGTTTTGAGCA^ 
CCAGCCAAACTTGAAGAAGGAAGGCTTGGTAGGAGGTGGAGTGTATCTTGACTTCCGTAACA^ 
CGTGATTGAAAACATCAAACGTGCCAACCAAGACGGTGCCCTCATTGCCAACCACGTGAAGGCAGAAGGCTTCCT 
CTTTGACGAAAGTGGCAAGATTACAGGTGTTGTAGCTCGTGATCrcrrGACAGACCAAGTGTTTGAAATCAAGGCC 
CGTCTGGTTATTAATACAACAGGTCCITGGAGTGATAAAGTACGTAATTTGTCn'AATAAGGGAACGCAATTCTCAC 
AAATGCGCCCAACTAAGGGAGTTCACnTGGTAGTAGATTCAAGCAAAATCAAGGTITCACAGCCAGTTTACTTCG 
ACAC^GGTTTGGGTGACGGTCGTATGGTCTTTGTTCTCCCACGTG 

AGACTACACAGGTGATTTGGAGCATCCAAAAGTAACTCAAGAAGATGTAGATTATCTACTTGGCATTGTCAACAA 

CCGCTTCCCAGAATCCAACATCACCATTGATGATATCGAAAGCAGCTGGGCAGGTCTTCG 

TCCATTGATTGCAGGGAACAGTGCCrcrGACTATAATGGTGGAAATAACGGTACCATCAGTGATGAAAGCTTTGA 
CAACTTGATrGCGACTGTTGAATCrrTATCTCTCCAAAGAAAAAACACGTGAAGATGTTGAGTCT 
CnTGAAAGTAGCACATCTGAGAAAGATTTGGATCCATCTGCAGTTTCTCGTC 

ATGGTCTCTTGACTCTTGCTGGTGGTAAAATCACAGACTACCGTAAGATGGCTGAAGGAGCTATGG 
TTGACATCCTCAAAGCAGAATTTGACCGTAGCTTTAAATTGATCAATTCTAA^ 

ATTGAACCCAGCAAATGTGGATTCAGAAATCGAAGCCTTTGCGCAACTTGGAGTATCACGTGGTTTGGATAGCAA 
GGAAGCrCACTATCTGGCAAATCTTTACGGTTCAAATGCACCGAAAGTCTTTGCACITGCT 

GCG CC AGG ACTCAGCTTGG CAG AT ACTTTGTCCCTTC ACT ATGC A ATG CGCAATG AGTTG ACTCTTAG CCCAG TTG 
ACTTCCTTCITCGTCGTACCAATCACATGCTCnTTATGCGTGATAGCTTGGATAGTATCGTTGAGCCAAT^ 
GAAATGGGACGATTCTATGACTGGACAGAAGAAGAAAAAGCAACTTACCCTGCTGATGTCGAAGCAGCTCTCGCT 
AACAACGATTTAGCAGAATTAAAAAATTAA 

40 4106.8 

ATGATGAATGAATTATTTGGAGAATTTCTAGGGACTTTAATCCT 

TGGTTCTTCCTAAAACCAAGAGCAATAGCTCAGGTTGGATTGTGATTACTATGGGTTGGGGGATTGC 
TGCAGTCITTGTATCTGGCAAGCTCAGTCCAGCITATTTAAACCCAGCTGTGACCATCGGTGTGGC CTTAA AAGGT 
GGTTTGCCTTGGGCTTCCGTTTTGCCnTATATCTTAGCCCAGTTCGCAGGGGCC^ 
45 GTTGCAATTCAAACCTCACTATGAGGCAGAAGAAAATGCAGGCAATATCCTGGCAACCTTCAGTA CTGGACCAG C 
CATCAAGG ATACTGTATCAAACTTG ATTAG CG AAATCCTTGG AACTTTTGTTTTG GTGTTG ACAATCTTTG CTTTGG 
GTCITTACGACriTCAGGCAGGTATCGGAACCrTTGCAGTGGGAACTTTGAT^ 

TGGGACAACAGGTTATGCCTTGAACCCAGCTCGTGACCrTGGACCTCGTATCATGCACAGCATCTTGCC^ATTCCA 
AACAAGGGAGACGGAGACTGGTCTTACGCTTGGATTCCTGTTGTAGGCCCTGTTATCGGAGCAGCCTTGGCAGTG 
50 CTTGTATTCTCACTTTTCTAG 

4106.10 

ATGAAAAGGACCTGGAGGAACTCATTCGTGACAAATCTTAATACACCTTTTATGATTGGCAATATTGAGATTCCCA 
ATTOTACCGTTTTAGCGCCTATGGCTGGCGTGACCAACTCAGCCnTTCGTACTATCGCAAAGGAGCTCGGAGCTGG 

55 ACTCGTTGTAATGGAAATGGTCTCTGACAAGGGAATCCAATACAACAACGAAAAAACCCTGCACATGCTTCATAT 
CGATGAGGGCGAAAACCCTGTCTCTATCCAACTTTTTGGTAGCGATGAAGACAGCCTAGCACGCGCAGCAGAATT 
C ATCCAAG AAAAC ACCAAG ACCG ATATCGTCG ATATC AACATGGG CTGCCCTGTCAAC AAAATCGTGAAG AACG A 
AGCTGGTGCTATGTGGCTCAAGGATCCAGACAAGATTTACTCCATCATCAACAAGGTCCAGTCTGTCCTTGATATC 
CCACTTACTGTCAAAATGCGTACCGGCTGGGCGGACCCATCTCTTGCAGTAGAAAATGCTCTCGCTGCTGAA 

60 CAGGTGTTTCTGCCCTCGCCATGCATGGCCGTACCCGTGAACAAATGTATACTGGCCACGCAGACCTTGAGACCCT 
TTACAAGGTTGCCCAAGCTCTAACCAAGATTCCATTCATCGCCAACGGTGATATCCGTACTGTCCAAGAAGCCAA 
GCAACGCATCGAAGAAGTTGGTGCTGACGCAGTCATGATTGGCCGAGCTGCCATGGGAAATCCTTACCTCTTCAA 
CCAAATCAACCATTACITTGAAACAGGAGAAATCCTACCTG ATTTG ACCTTTGAAGACAAGATGAAG AT 
CGAACACITGAAACGATTGATTAACCTCAAAGGAGAAAACGTCGCAGTTCGTGAATTCCGCGGTCTCGCTCCTCA 
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CTATCTCCGTGGAACATCTGGCGCTGCCAAACTCCGTGGAGCCATTTCGCAAGCCAGCACCCTG 
AACCCTCTTGCAATTGGAGAAGGCTTAA 



5 

4107.1 

ATGACAAAGAAGAAAATTGAGCGTATTTCTGTAATACACCGAGAAAAGATTTTATGGCTCAAGTGGTA 
GAGATAAAGAACAACCTAAGTATAGTGTCCTTGAGCGTAAAATGTTTGATGCTGCTAAAAATCAAG ATATC 
10 CTTATCAAAAATACGCAACTATCAAGCAGATAACAGATATTAGGGTACAAACAAGTGAGGCTGACATTTTAGAG 
CTGTAAAAGAGGTTTATGTCTACAATCACATGAATGTTATCGGAGCTTGTCAGCGGATATTATTTATCAGTCAA 
A CCAGCTT ATG AT AAGTTAAATAAGTG GTTTAAT ATCT ATTCTG ATTTGTATTTT AG CGTTGT ACCCTTG CCC AAAA 
TGGGGGTATATCATGAGATGGTAGGTATCTAG 

15 4107.2 

ATGAAAAATTCCAACG AGGCTGAG ATG AAATTACTTTATACTG ATATTCGGACl' l'C I ' 1T G ACAGAAAT TCTAACAA 
G AG AG G C AG AAG AGCTAGTTG C AG CTG G C A AGCGG GTCTTCT ACATTG CCCCC AACTCTC^ 
ACGCGCCGTGCTGGAATACTTGTCCCAGCAGGC1 TClTnTCGATTACCGTCACGCGCTTTG CTCAA ATGGCTCGC 
TATCTGGTCTTGAATGATTTACCAGCTAAAACTACTCTTGATGATATCGGTCTTGGG 

20 TGCCGAACTCGATCCCAAGGACTrGCGTGTTTATGGCGCrATTAAGCAGGATCCTCAATTGATCCAGCAGTTAA^ 
GAGCTTTACCATGAGATGACCAAATCTCAGATGAGTTTTTTGGACTTGGAGAATTTAACAGATGAGGATAAGAGG 
GCGGATTTACTCTTGATTTTTGAGAAAGTAACAGCCTATCTTAATCAAGGTCAGTTAGCCCAGGAAAGTCAG 
CCCATTTGATTGAGGCTATTGAGAATGACAAGGTAAGTAGTGATTTTAATCAAATCGCCTTG 
TACTCGTTTTTCTGCTGAGGAAGAGCGGGTTGTGGACTTACTTCACGGCAAAGGTGTTGAGATTGTTATCGGGGCT 

25 TATGCTAGTAAGAAAGCCTATACCAGTCCTTTTAGCGAGGGCAATCTCTACCAAGCCAGCGTAAAATT^ 

ATCTGGCTTCTAAATACCAAACGCCTGCTCAGGACTGTTCTCAAACTCATGAGAAGATGGATAGTTTTGA^ 
CTCTCGTTTGTTGGAGTCTTCTTATGACnTTTCAGAACTCGCTTTGGATGTCGATGA 

CAAATCTGGTCITGTITGACGCAAAAGGAGGAGTrGGAGCTAGTAGCCCGTAGTATTCGTCAGAAATTACATGAG 
AACTCAGACCTGAGCTACAAGCATTTrCGTATTCTCTTGGGGGATGTAGCTTCTTACCAGTTATCT 
30 TTTTTGACCAGTATCAGATTCCnTTTTATCTTGGTAGAAGCGAAGCCATGGCTCATCATCCCTTGACT 
GAGTCTArrTTAGCTTTAAAACGTTACCGrrTTCGTCAGGAGGATTTGATTAA 

TGACCTCAGTCAGTCTGATATTGATGCrrrTGAGCAATATATCCGCTATCrrGGTATCAATGGCTTGCC 
AGCAAACCTTCACCAAATCCCACCATGGAAAATTTAATCTTGAGCGTTTGAATGTCCTCCGC 
ACCTCTTGAAACCCTCTTTGCCAGCCGAAAACAAAAGGCTGAAAAACTCCTACAAAAATGGAG^ 
35 AGAAGGAGCTGTGACCAAGCAGTTACAAGATTTGACAACCACTT TGGA AGCTGTAGAACAGGAAAGACAAGCCG 
AAGTTTGGAAGGCTTTCTGCCATGTTTTAGAACAATTTGCGACTGTTTTTGCTG 
CTrCCTAGCCTTGCTCCATTCTGGAATGAGrrTGTCCCAATACCGTACCATTCCAGCAACAG 

GTGCAG AGTTACG ATTTG ATTGCACC ATTG ACTG CTG ACTTTGTCTATG CTATTGG ACT AACTC AG G ACAATTTAC 
CAAAAATTTCTCAAAACACCAGTCTTCTGACAGATGAAGAAAGGCAAAACCTAAACCAAGCGACCGAAGAAGGC 
40 GTTCAATTACTGATTGCCAGCAGTGAAAATCTCAAGAAAA^^^ 

GTAAGCAGTTGTTCTTGTCGGCTCCAAGCCTTTTTAACGAAAGTGAAAGTAA 

G ATCC ATTTTGG ATTTAGGCGG AG AG AG A AG AG G ATG AATCA C AAAGG ACTGTCT AAGG AGG ATATGGGG TCCTA 
TCACAGTCTTTTGTCTAGTCTGGTTGCCTATC ACCAGCAGGGTG AG ATG AG CGATACTGAGC AAG ATTTG AC I' 1 "1"! 
GTCAAGGTTCTGTCGCGTGTCATAGGTAAAAAACTAGATCAGCAAGGTCTGGAAAATCCAGCTATCCCAACCAGT 

45 CC AAG C AG C AAG ACCTT AGCC AAGG AC ACCTTGC AAG CTCT 

CrATCCTGCCAAACAGGAGTTTTACCTGTCTACGTCGGGTTTGACAGAGTTTTATCGCAATGAATACAGTTATTTC 
CTACGCTACGTTTTAGGCTTGCAGGAGGAATTACGTTTGCATCCTGATGCCCGTAGT^ 
GTATCTTTGAACGCGCCTTACAGTTGCCTAATGAAGATTCCTTTGACCAACGTCTAGAAC^ 
CAGTCAAGAACGCGAATTTGAAGCTATTTATCAAGAAAGTTTGGAAGCCCAGTTTACCAAGGAA 

50 GTTGCACGGACAACTGGACATATTCTCCGACACAATCC^GCCATCGAAACC^TCAAAGAAGAAGCAAATTTTGGT 
GGAAAAGACCAAGCCTrTATTCAATTAGACAATGGACGCAGTGTCTTTGTACGAGGCAAGGTGGACCGGATTGAC 
CGTTTGAAAGCrAATGGAGCGATAGGAGTAGTAGACTACAAATCCAGTCTGACrCAGTTCCAGTTTCCrC 
TTAATGGGCrCAATTCrCAGTTACCAACCTATCTTGCTGCCCTAAAAAGAGAAGGGGAGCAGAACTT^ 
CATGTACITGGAAATGGCTGAACCrGTCCAATCTCTGATGGCGGTAAAAAGTCTGGCAGGA GCAG TGGTAGAAGC 

55 CAGCAAATCTATGAAATACCAAGGGCTCTTCTTGGAAAAAGAAAGCAGTTATTTAGGCGAATTT^ 

CAAGGCTAATCAACTGACAGATGAGGAATTTCAGCTCCTACTGGACTACAATGCCTATCTrrACAAGAAA 

GAGAAGATTTTAGCAGGCCGGTTCGCCATCAATCCTTATACTGAAAATGGCAGAAGCATTGCCCCATACGTCCAG 

CAACATCAGGCTATTACAGGCTTTGAAGCCAATTACCATCTC^ 

CrGATGGCAAGCGTCTGGTCGGAGAAAAACrCAAGCAAGCTTGGCTTGAAAAAATAA 
60 GAGAGGAGTTGAATCGATGA 

4107.3 

ATG AAG CTT ATTCCCTTTTTAAGTG AGG AGG AG ATTC A AA AACTG C AAG A AGC AG AAGC AAATTCG AGC AAG G A A 
CAGAAGAAAACTGCCGAGCAAATCGAAGCTATCTACACTTCrGCCCAGAATATCCTGGTCTCAGCATCGGCTGGT 
65 TCTGGAAAGACCTTTGTCATGGCAGAGCGCATTCTGGACCAATTGGCGCGTGGTGTCGAAATTTCTCAACTCTTTA 
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TCTCAACCITTACCCTCAAGGCTGCAACT 

AAACAGATGATGTCXjACCTCAAACAACACTTGGGTCGCCAGTTGGCAGACCTACCCAACGCTGCCAT TGGAA CCA 
TGGATTCirTCACACAAAAATTCCTTGGCAAACATGGTTATCTGCTTGATAT^ 

AACCAAAGCGAGCAACTTATTCTCGAAAACGAAGTCnTTCATGAGGTCTTTGAAGCGCATTACCAAGGTAA 
5 AAAG AG ACCTTT AGTCATTTGCTG AAAAACTTTGCTGGG CGTGGCAAGG ACG AACG G GGTCT G CGCCAGCAGGTC 

TATAAAATCTATGACITCCTCCAATCCACCAGTAATCCTCAAAAGTGGCTGAGTGAATCT 
AGAAAGCTGATTTTACCAGTGAAAAAGAAAAACTGACCGA 

TTTTCCGTTACCATCTGGATAACGATGCCAAGGAGTTTGCAAAGGCTGCCTATTTAGAAAATG^ 
GGATGAAATTGGCTCCCTAAATCAGGAGTCCGATAGTCAGGCTTATCAGGCAGTGCTTGCGCGTGTTGTCGCCATC 
10 TCTAAGGAGAAAAACGGTCGAGCTCTGACTAATGCC^GCCGTAAGGCTGATTTGAAGCCCCrrGGCT^ 

AACGAAGAGAGAAAGACCCAGTTTGCTAAACTAGGACAATTATCAGACCAGATAGCGA T 

TCTCGACTATCAAGAACGTTATCATGGAGACACTTGGAAACTAGCTA 

GTAGAGGCTTATCGTCAGAGAAAACGACAGGAAAATGCCTTCGAATTCGCTGATATCAGCCATTACACCATTGAG 
ATTTTAGAGAATTTCCCACAAGTTCGTGAGTCTTATCAGGAGCGCTTCCATGAAG TCATG GTCGATGAGTATCAGG 
15 ATACCAACCATATTCAAGAACGGATGCTGGAATTGTTGTCTAATGGCCACAATCGCTTTATGGTGGGAGATATCAA 
GCAATCCATCTATCGTTTCOjTCACKjCAGACCCGCAGATTTTCAATGAGAAATTCCAA^^ 

CAAGAAGGC^GGCTCATTATCCTCAAGGAAAATTTCCGTAGTAGTTCAGAAGTGCTGTCAG CAACC AATGATGTC 
TTTGAACGTCTCATGGACC^GAGGTCGGCGAAATCAACT^^ 

AACTGACTCCCAATCCAGACAACAAGGCAGCATTTCTCCTCTACGACAAGGACGATACAGGTGAGGAAGAAG 
20 GTCAAACAGAAACGAAACTAACAGGCGAAATGCGCTTAGTTATCAAGGAGATTCTGAAACTTCATCAAGAAAAAG 
GTGTTGCCTTTAAGGAAATTGCCCTTCTGACCTCCAGCCGCAGTCGTAATGACCAGATTCTCCTCGCCCTGTCTGA 
GTACGGAATTCCTGTCAAAACTGACGGAGAGCAAAACAATTATCTCCAATCCCTAGAAGTGCAAGTCATG CTAGA 
CACTCTTCGTGTCATTCACAATCCCCTGCAAGACTACGCCTTGGTTGCCCTTATGAAGTCT 
ATGAGGATGAGCTAGCACGTTTGTCCCnTCAGAAAGCAGAGGATAAACrrCCACGAAAATCTCTA 
25 TCAATGCACAAAAAATGGCAAGTAGTCAAAAAGGCTTGATTCACACAGCTCTAGCTG 

AAAAACT AAAGCAATTC ATGG ATATCCT AGCTTCTTGG CG CTTGTATG CC AAAACCC A CTCTCTCT ATG ACTTG AT 

TTGGAAGATTTACAACGACCGTTTTTATTATGACTATGTTGGGGCTTTGCCGAATGGTCCTG 

CrCTATGCCCTAGCACTGCGTGCrGATCAATTTGAAAAGAGCAATTTCAAAGCriTrc 

TTGACCAAGTCTTAGAAGCCCAGCACGATTTGGCAAGCGTGGCCGTCGCACCGCCAAAAGATGCAGTAGAGCTCA 
30 TGACCATCCACAAGAGTAAAGGGCTGGAGTTTCCTTACGTCTTTATCCTCAATATGGATCAAGATTTCAACAAGCA 
AGACTCTATGTCAGAAGTCATTCTCAGTCGTCAGAATGGTCTTGGTGTCAAATATATTGCCAAGATGGAGACAGGG 
GCAGTAGAAGACCACTATCCTAAAACCATCAAACTCTCCATTCCTAGTCTGACCTATAGGCAGAACGAAGAGGAA 
TTACAGCTAGCAAGCTATTCTGAGCAGATGCGTTTGCTGTATGTTGCTATGACGCGGGCTGAGAAAAAGCTCTAT^ 
TTGTCGGCAAGGGTTCTCGTGAAAAGCTGGAATCCAAGGAATACCCAGCAGCCAAAAATGGGAAACTAAATAGCA 
35 ATACTAGACTGCAAGCACGGAATTTCCAAGATTGGCTTTGGGCTATCAGTAAAGTGTTTACTAAGGAC 

ACTTTAGTTATCGTTTTATTGGCGAAGATCAGTTGACCAGAGAAGCTATCGGAGAGTTGGAAACCAAGAGTCCTCT 
CCAAGATAGCTCCCAAGCAGACAATCGTCAGTCAGATACCATCAAAGAAGCTCTGGAAATGCTGAAGGAGGTGGA 
AGTTTATAATACTCTTCACCGCGCAGCTATTGAACTTCCTAGTGTTCAAACCCCAAGTCAAATCAAG 
GAACCAGTTATGGATATGGAAGGTGTCGAGATTGCTGGTCAAGGTCAGTCAGTAGGCAAGAAAATCAGCTTCGAT 
40 TTGCCAGATITTTC^ACCAAAGAAAAGGTAACTGGAGCTGAGATTGGTAGTGCTACTCACGAACTCATGCAGAGA 
ATTGACCTCAGCCAGCAACTAACCCTTGCTAGCCTAACAGAAACACTCAAACAAGTTCAAACTAGCCAAGCTGTC 
AGAGACAAGATCAATCTTGATAAAATTCTTGCTTTCTTTGACACAGTACTCGGTC^GGAAATTCTTGCT 
ACCATCTTTATCGCGAGCAACCTTTCTCCATGCTCAAACGAGA 

TATCCTTGATGGCTATCTGCTTTACGAAAACAAAATTGTTCTGTTCGACTACAAGACAGACCGCT 
45 AGTCAACTCGTAGACCGCTATCGTGGTCAGTTAGCTCrATACGAAGAGGCTTTATCACGAGCCTATTCGATTGAAA 
ATATTG AAAAAT ACTTG ATTTTACTCG GT AAAG ACG AGGTTCAAGTTGTAAAAGTATAA 

4109.1 

ATGGAACTTGCTCGCCATGCTGAAAGCTTGGGAGTAGATGCTATTGCAACGATTCCACCAATTTAT^ 
50 C^GAATACTCAGTTGCCAAATACTGGAACGATATCAGTTCTGCAGCTCCAAACACAGACTACGTGATTTACAACA 
TTCCTCAATTGGCAGGGGTTGCITTGACTCCAAGCCTTTACACAGAAATGTTGAAAA^ 

G AAG AACTCHTCT ATG CC AGTTC AAG AT ATCC AAACCTTTGTC AG CCTTG GTGG AG AAG ACC ATATCGTCTTTAAT 
GGTCCTGATGAGCAGTTCCrAGGAGGACGCCrCATGGGGGCTAGGGCTGGTATCGGTGGTACTTATGGTGCTATGC 
CAGAACTCnTCTTGAAACTCAATCAGTTGATrGCGGATAAGGACCTAGAAACAGCGCGTGAATTGCAGTATGCTAT 
55 CAACGCAATCATTGGTAAACTCACrTCrGCTCATGGAAATATGTACGGTGTCATCAAAGAAGTC^ 
GAAGGCTTGAATATTGGATCTGTTCGTTCACCATTGACACCAGTGACnX} 
GCTGCTGCCTTGATTCGTGAAACCAAGGAGCGCTTCCTCTAA 



60 ATGTATAAGACAAAGTGTTTACGAGAGAAGTTAGTATTATTTTTAAAAAi' I ' n'CTl'CCCAATCCTGATCTACCAAT 

TTGCCAATTATTCTCCCTCITTTGTTGATACTGCAATGACAGGTCAATACAACACT 
ATGGCAACCAGTATCTGGAATCCnTTCnTTACATTTCTAACAGGGATTGTGTCAGCCrr 
CCATCTTGGTCGAGGCAAAAAGGAAGAAGTTGCGTCTGATITTrACCAATTrATTTAT^ 

GTGGTCTTGCTGGGGATGGTAClUUHVlM'GGCACC^ATAATCTrGAATCATATTGGGTTAGAAGCAGCAGTAGCGG 
65 CAGTAGCGGTTCGCTATCrTTGGTTTTTATCTATCGGGATTATCCCCTTGTTGCTCTr^ 



SUBSTITUTE SHEET (RULE 26) 



WO 00/06737 



35 



PCT/GB99/02451 



TGGATTCGCTGGGCTTGACCAAACTCTCCATGTACCTCATGCT 

CTCTTGATTTACGGTGCCTrrGGTGTrCCAGAACTGGGAGGGGCTGGTC 

GGGTCTTGCTTGGGATTTCTGTTCTGGTTTTATTTAAACAGGAGAAGCT 

TCCACTTAATATGGATAAAATTAAGGAAGGAGTTCGTTTAGGTCn'GCCTATTGGGGGAACT 

GCTATCTTTTCAGTGGTTGGCTTGATTATGGCTAAGTTTTCGCCCTTGATTATAGCTAGTCAC^ 

CTTTTCAAGTCTTATGTACGCCTTTCCTATGAGTATCTCATCGGCTATGGCTATTGTC GTTTCCT 

CCAAGCGATTTGATGATGCGAAAACCTATATTGGTCrAGGAAGATGGACTGCCCTCATriTTGC 

AACCTTCCTTTACATTTTTAGGGGAAATGTGGCCAGTCTTTATGGTAACGACCCAAAATTTATCG^ 

CGTTTTTTAACTTATAGTCTTTTCTTCCAGTTAGC 

TAAGGATACAGTTATTCCnTTTACCTTGGTTTGCTTGG 

4112-2 

ATGAGTACITTAGCAAAAATAGAAGCGCTCTTGTTTGTAGCGGGTGAAGATGGGATTCGGGTCCGCCAGTTAGCT 

GAACTCCTCTCTCTGCCACCGACAGGCATCCAGCAAAGTTTAGGAAAATTAGCCCAGAAGTATGAAAAG GACC CA 

GATTCC^GTTTGGCTTTGATTGAGACAAGTGGTGCTTATAGATTGGTGACCAAGCCTCAATTTGCAGA^ 

AGGAATACTCTAAGGCGCCTATCAACCAGAGCTTGTCTCGGGCTGCCCTTGAGACCTTGTCCATTATTGCCT 

ACAGCCGATTACGCGGATAGAAATTGATGCCATCCGTGGAGTTAACTCGAGTGGAGCCTTGGCAAAGTTGCAGGC 

TTTTGACCTGATAAAGGAAGACGGGAAAAAGGAAGTATTGGGGCGCCCCAACCTCTATGTGACTACGGATTA^ 

CCTAGATTACATGGGGATAAACCATTTAGAAGAATTACCAGTGATTGATGAGCTTGAGATTCAAGCCCAAGAAAG 

CCAATT ATTTGGTG AAAGG AT AG AAG AA G ATG AG AATC AAT AA 



ATGGATACGATGATTAGTAGATTTTTTCGCCATTTATTTGAAGCCTTAAAAAGTTTGAAACGA^ 
CAGTAGCTGCTGTCAGTTCAGTCATGATTACTTTGACCTTGGT 

AAACTAGCTACAGATATTGAAAATAATGTCCGTGTAGTAGTTTATATCCGAAAGGATG TGGA AGATAATAGTCAG 

ACAATTG AAAAAG AAG GTCAAACTGTTAC AAATAATG ACT ACCAC AAGGTAT ATG ATTCTTTG AAG AACATGTCT 

ACGGTTAAAAGTGTTACCTTTTCAAGTAAAGAAGAACAATATGAAAAATTAACCGAGATAATGGGAGATAACTGG 

AAAATCTTTGAAGGAGATGCCAATCCTCTCTATGATGCCTATATTGTAGAGGCAAACACTCCAAATGATGTAAAA 

ACTATAGCCGAAGATGCTAAAAAAATTGAAGGTGTCTCTGAGGTTCAAGATGGCGGTGCCAATACAGAAAGACTC 

TTCAAGTTAGCTrCATTTATCCGTGTTTGGGGACTAGGGATTGCTG 

TTCAAATACCATTCGTATTACCATTATTTCCCGCAGTCX»CGAAATTCAAATCATGC^ 

AGTTATATCCGTGGACCGTTCTTGTTAGAAGGAGCCTTTATCGGTTTATTGGGAGCTATCGCACCATCT 

CTTTATTGTTTATCAAATTGTTTACCAATCTGTCAACAAATCGTTGGT 

G ATTT ATTT AGTCCGTTG ATG ATTG CCCT ACT ATTTGTG ATTGG G GTTTTC ATTGGTTC ATTGGG ATC AG G AAT ATC 
CATGCGCCGATTCTTGAAGATTTAG 



ATGAAGAAAGTAAGATTTA l , ' l , ' ri l 'l- l AGCTCTGCTAl-rri T C II AGCTAGTCCAGACGGTG CAATGG CTAGTGATG 

GTACITGGCAAGGAAAACAGTATCTGAAAGAAGATGGCAGTCAAGCAGCAAATGAGTGGGTTTTTGATACTCA^ 

ATCAATCnTGGTTCTATATAAAAGCAGATGCTAACrATGCTGAAAATGAATGGCTAAAGCAAGG 

TTACCTCAAATCTGGTGGCTATATGGCCAAATCAGAATGGGTAGAAGACAAGGGAGCCMl'lUATTATCTTGACCAA 

GATGGAAAGATGAAAAGAAATGCTTGGGTAGGAACTTCCrATGTTGGTGCAACAGGTGCCAAAGTAAT^ 

TGGGTCTATGATTCTCAATACGATGCITGGTTTTATATCAAAGCAGATGGACAGCACGCAGAGAAAGAATGGCTC 

CAAATTAAAGGGAAGGACTATTATTTCAAATCCGGTGGTTATCTACTGACAAGTCAGTGGAT TAAT ^ 

TG AATG CTAGTG GTGCCAAAGTACAG CAAGGTTGGCTTTTTG AC AAAC AATA CCAATCTTGGTTTT AC ATC AAAG A 

AAATGGAAACTATGCTGATAAAGAATGGATTITCGAGAATGGTCACTATTATTATCTAAAATCC 

GCAGCCAATGAATGGATTTGGGATAAGGAATCTTGGTTITATCTCAAATITGATGGGAAAATGGCT 

TGGGTCTACGATTCTCATAGTCAAGCTTGGTACTACTTCAAATCCGGTGGTTACATGAC^ 

GGGATAAGGAATCTTGGTTTTATCTCAAATCTGATGGGAAAATAGCTGAAAAAGAATGGGTCTACGATT 

GTCAAG CTTGGTACTACTTCAAATCCGGTGGTTACATG ACAG CC AATG AATGG ATTTGGG AT AAGG AATCTTGGTT 

TTACCTCAAATCTGATGGGAAAATAGCTGAAAAAGAATGGGTCTACGATTCrCATAGTCAAGCTTGGTACT 

AAATCTGGTGGCTACATGGCGAAAAATGAGACAGTAGATGGTTATCAGCTTGGAAGCGATGGTAAATGGCTTGGA 

GGAAAAACTACAAATGAAAATGCTGCTTACTATCAAGTAGTGCCTGTTACAGCCAATGTTTATGATTCAGATGGTG 

AAAAGCnTTCCTATATATCGCAAGGTAGTGTCGTATGGCTAGATAAGGATAGAAAAAGTGATGACAAGC GCTTG G 

CTATTACTATTTCTGGTTTGTCAGGCTATATGAAAACAGAAGATTTACAAGCGCT 

CCCnTATTATGAGAGTGATGGCCACCGTTTTTATCACTATGTGGCTCAGAATGCTAGT 

CITTCTGATATGGAAGTAGGCAAGAAATATTATfCGGCAGATC 

CCTTCCTTTrCAAAGArrTAACAGAGGCTACAAACTACAGTGCTGAAGAATTGGATAAG 

CATTAACAATAGCCnTTTGGAGAACAAGGGCGCTACITTrAAGGAAGCCGAAGAACATTACCATATCAATGCT 
TATCTCCTTGCCCATAGTGCCCTAGAAAGTAACTGGGGAAGAAGTAAAATTGCCAAAGATAAGAA TAAi i in i IG 
GCATTACAGCCTATGATACGACCCCTTACCITrCTGCTAAGACATTTGATGATGTGGATAAGGGAA 
^AACCAAGTGGATTAAGGAAAATTATATCGATAGGGGAAGAACTTTCCrTGGAAACAAGGCTrCTGGTATGAATG^ 
GGAATATGCTTCAGACCCITATTGGGGCGAAAAAATTGCTAGTGTGATGATGAAAATCAATGAGAAGCT 
CAAAGATTAG 
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ATGAAAAAAGTATTACAAAAATATTGGGCATGGGCTTTTGTG^ 
ATGTGCCGATGTTTCAAGGAGCCTTTTACAGTTTTACCAACTGGACA 
CTTAAACAACTTTAAGCTCCTCTTC^TGGATCCAAAATTCATGAATGCGATTG 
5 GCCATGGTGGTTGGTGAGATTGCACTCXjGGATCnTCATTGCGCGTGTCTTGAATTCTAAAATCAA^ 
TCTTCCGTGCITGGTTCTTCrTCCCAGCTGTTTTATCTTO 

T ACGGTCTTCC AGCG ATTGG AAATGCCCTTC AT ATTG AATTTTTCC AAACCAGTCTTTTAGGG ACT G G AG 

CAATCTTTGCGGCTGTCTTTGTCCTTCTTTGGCAAGGGGTGGCTATGCCCATCATCATCT 

TCTATTCCAACTGAGATTACAGAGGCAGCAAGGATTGATGGTGCGACTAGCAAGCAAGTTTTCT GGAA CATTGAA 

10 TTGCCTTACITGCTACCAAGTGTCTCTATGGTCTTTATC 

CTTTGCCATGACCGGTGGTGGTCCAAACAATGCCACAACCTCACTTG^ 

AACAACCAATTCGGTTATGCCAATGCCATTGCCGTAATCnTGTTCTTCTTAATTGTAGTGATTTCGA 
GAGAGTATCTAAGAAATTTGAAATTTAA 



ATGATGAAACAAGATGAAAGAAAAGCCCTGATTGGCAAATACATTCT 

TGCCGCTCCTTGCTACCCTCTTT AGTTCCTTC AAACCCACTAAGG ATATTGTAG AT AA'l" I" l'C 1 " 1 "1 GGCTTTCC AACC 
AAClTCACATGGGACAACTTTAGCCGTCTCnTAGCTGATGGGATTGGAGGCTATTATTGGAACTCTGTCGTCATCA 

CTGTCTTGTCTTTACTTGCAGTAATGATCT^ 
20 AAAAGCCTTTACCATCATGTATACCCTCTTAATCCTCGGAATCTTCCTACCTTTCCAAGTCATCATGATT^ 
CGGTTATGATGAGTAAACTCGGTTTGGCTAATACCITTGGTTTGATCTTGCTCTACTTG 

ACCCTCTTTCTCTATGTTGGCTATATCAAAATCTCGATTCCAGAAAGTCTGGATGAAGCAGCAGAGATCGATGGGG 
CTAATCAATTTACAACCTATTTCCGCATCATCTTCCCAATGATGAAACCGATGCATGCGACAACCATGATCATCAA 
TGCCCTTTGGTTCTGGAATGACTTCATGTTGCCACTCCTTGTCTTGAACCGGGATTCCAA^ 
25 TtnTCCAATACAACTACGGAGGCCAATATTTCAACGACTACGGACCAAGCTTTGCCTCTTACGTGCT 

C AGT ATC ACC ATTGTCTATCTCTTCTTCC AACG CC AT ATCATTTC AGG AATG AG C AACG G G G C AGTG AAGTAA 

4119.4 

ATGAAAAGTATTCTTCAGAAAATGGGGGAGCATCCGATGCTGClU'ClM'l'l'lC'1'1 AG CTATAGTACT GTTATATCCA 
30 TTCTTGCACAAAATTGGATGGGTCTTGTGGCTTCAGTAGGAATGTTTCTATTTACTA'r 
TCGATTTTATCCCATAAATTCTTTCGATTGATTTTGCAGT^ 

AGTTTAGAACATTTCCAAATTGTGAAGAAATTTAACTATGCl-I'riCri TCACCCAATATGCAGGTGTG GCATC AGA 
ACCGGG C AG AAGTG AC C ITC II 1 AATCCTAATTATT ATGG AATT ATTTGTTGTTTCTGT ATT ATG ATTG CTTTCT AT 
CTGTTTACAACGACCAAGTTGAATTGGTTGAAAGTATTCTGTGTGATTGCAGGCTTTGT^ 
35 CTTTACTCAAAATCGAACTGCCTTTCCTGCTATTATCGCTGGAGCAATTAT CTATCT CTrrACGACTA 
GGAAGGCCTTTTGGCnTAGTATTGGGGTCTTCGCGATTGGTTTGAGTTTCCTCTTT^ 
ATGGGTACTTTAGACTCTTCTATGGAAGAACGCATTTCTATCTGGGATGCnX3GGATGGCCT^ 
CTTTTTGGGGTGAAGGGCCATTGACCTATATGAACTCTTATCCTCGGATACATG CT 

CAGTCTTTATATTG AT ACG ATTCTG AGTTACGG AATTGTGGGG ACTATTITATT AGTT TTGTCT TCTG TTG CTCCTG 
40 TTCGCITGATGATGGATATGAGTCAGGAGTCGGGGAAACGTCCGATTATCGGCCTTTATCTATCTTTCCTT 
GGTTGCTGTGCACGGAATTTTTGACITGGCTCTCTTCTGGATTCAGTCAGGCT^ 
GTATTCCATTGGAGCATCGAATGTTGGTATCGGACATGACGGATTAA 

4120.1 

45 ATGTCAAAGATGGATGTTCAGAAAATCATTGCACCGATGATGAAGTTTGTGAATATGCGTGGCATTATAGCTCTAA 
AAGATGGGATGTTAGCAATTTTGCCATTGACAGTAGTTGGTAGTTTGrrCITGATrATGGGACAA 
AGGATTAAATAAGAGCATTGCTAGTGTTTTTGGAGCTAATTGGACAGAGCCGTTTATGCAAGTATATTC AG 
TTTGCTATTATGGGTCTAATTTCHTGTTTrrCAATTGCCTATTCTTATGCT 
AGCTGGAGTTCTATCTGTATCTGCATTCTTTATTTTGCTAAGATCATCITATATCCCT/^ 

50 GGGACGCTATTAGTAAAGTTTGGTTTGGAGGCCAAGGAATTATCGGTGCTATCATTATAGGTTTGGTAGTAGGAAG 
TATTTATACCTTCTTrATAAAGAGAAAAATTGTTATTAAGATGCCAGAACAAGTTCCACAAGCTATTGCCAAACAG 
TTTG AAGCA ATG ATTCCAGCATTTGT AA'IT 1" I CI' fATCTTCT ATG ATTGTATATATTTT AGCG AAG TCAT TG ACTAA 
TGGCGGAACATTCATAGAAATGATTTATTCTGCTATTCAAGTTCCGTTGC^^ 
GCTATTGGAATTGCATTCTTTATATCATTTTTGTGGTGGTTTGCTG 

55 GACAGCTCTGCTTTTATCTAATCITGATGCTAATAAAGCTATGTTAGCCTCTGCTAAT^ 

GCACATATTGTTACTCAACAATTTTTAGATTCATITTTAATTCTATCAGGTTCAGGG ATTAC GTT^ 
TGCCATGCTTTTTGCAGCAAAATCAAAACAATACCAAGCCTTAGGAAAAGTTGCAG CTTTT C 
GTAAATGAGCCAGTTGTATTTGGATTTCCGATTGTCATGAATCCAGTTATGTTrGTACCTTTCATT 
ACITGCAGCTGTGATAGTATATGGAGCTATTGCAACAGGTTTCATGCAGCCATTCTCAGGGGTAACATTGCCTTGG 

60 AGTACACCAGCTATTTTATCAGGATTTTTGGTGGGTGGATGGCAAGGAGTTATTACTCAGCTGGTGATATTAGCGA 
TGTCTACATTGGTrrATTTTCCATTCITTAAAGTACAGGATCGTTTAGC^ 

4121.2 

ATGAAGAAAAAGGACTTAGTAGACCAACTAGTCTCAGAGATCGAGACGGGGAAAGTCAGGACACTGGGAATATA 
65 CGGTCATGGAGCTTCAGGTAAATCAACCITTGCACAGGAATTGTACCAAGCrTTAGATTCTACTACA^ 
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CTAGAGACAGATCCTTATATCACCTCAGGACGCCATCTGGTAGTACCCAAGGACGCGCCGAATCAAAAGGTGACA 
GCCAGTCTGCCAGTGGCGCATGAACTGGAGAGTTTGCAGAGAGATATCCTTgCTTGCAGGCGGGTATGGATGTCTT 
GA 

5 

4122.1 

ATGAAGAAAAGATACCTAGTCTTGACAGCTITGCTAGCCTTGAGTCTAGCAGCTTGTTCACAAGAAAAAA 

AATGAAGATGGAGAAACTAAGACAGAACAGACAGCCAAAGCTGATGGAACAGTCGGTAGTAAGTCTCAAGGAGC 

TGCCCAGAAGAAAGCAGAAGTGGTCAATAAAGGTGATTACTACAGCATTCAAGGGAAATACGATGAAATCATCXjT 

10 AGCCAACAAACACTATCCATTGTCTAAAGACTATAATCCAGGGGAAAATCCAACAGCCAAGGCAGAGTTGGTCAA 
ACTCATCAAAGCGATGCAAGAGGCAGGTTTCCCTATTAGTGATCATTACAGTGGTTTTAGAA 
ACCAAGCTCTATCAAGATTATGTCAACCAAGATGGAAAGGCAGCAGCTGACCGTTACTCTGCCCGTCCTGGCTAT 
AGCGAACACCAGACAGGCTTGGCCnTTGATGTGATTGGGACTGATGGTGATTTGGTGACAGAAGAAAAAGCAGCC 
CAATGGCTCTTGGATCATGCAGCTGATTATGGCTTTGTTGTC 

15 ATATGGCTGAAGAATGGCACCTGCGTTATGTAGGAAAAGAAGCTAAAGAAATTGCTGCAAGTGGTCTCAGTTTGG 
AAGAATACTATGGCTTTGAAGGCGGAGACTACGTCGATTAA 

4125.6 

ATGCGTAAATTCTTAATTATTTTGTTGCTACCAAGTTTTTTGACCATTTCAAAAGTCGT^ 
20 TCGTCTATACTTCGAAAGAAATTTATTACCTTTCACAATCTGACTTTGGTAT^ 
CCCATGGTTTATGGAGAGGTTCCTGTTTATGCGAATGAAG 

CAAGTTITCAAATAACCGAGTGGCGCTTAAATAAACAAGGAATTCCAGTATTTAAGCTATCAAATCATCAATTTAT 
AGCTGCGGACAAACGATTTTTATATGATCAATCAGAGGTAACTCCAACAATAAAAAAAGTATGGTTAGAATCTGA 
CTTTAAACTGTACAATAGTCCITATGATTTAAAAGAAGTGAAATCATCCTTATC 

25 GACAAGACCATGTTTGTAGAAGGAAGAGAATTTCTACATATTGATCAGGCTGGATC 

TCTGAAGAAGATAATCGGATGAGTAAAGTTCAAGAAATGTTATCTGAAAAATATCAGAAAGATTCTTT 
ATGTTAAGCAACTGACTACTGGAAAAGAAGCTGGTATCAATCAAGATGAAAAGATGTATGCAGCCAGCGTTTTGA 
AACTCTCTTATCTCTATTATACGCAAGAAAAAATAAATGAGGGTCnTTATCAGTTAGATACGACTGT 
ATCIXjCAGTCAATGATTTTCCAGGTTCTTATAAACCAGAGGGAAGTGGTAGTCTTCCTAAAAAAGAAGA^ 

30 AGAATATTCTTTAAAGGATTTAATTACGAAAGTATCAAAAGAATCTGATAATGTAGCrCATAAT 

TACATTTCAAACCAATCTGATGCCACATTCAAATCC^GATGTCTGCCATTATGGGAGATGATTGGGATCCAAAAG 
AAAAATTG ATTTCTTCT AAG ATG G CCG G G AAGTTT ATG G AAG CT ATTT AT AATC AAAATG G ATTTGTG CT AG AGTC 

TTTGACTAAAACAGATTTTGATAGTCAGCGAATTGCCAAAGGTGTTTCTGTTAAAGTAG 

CTCATAAAATTGGAGATGCGGATGAATTTAAGCATGATACGGGTGTTGTCTATGCAGATTCTCCATTTATTCTTTCT 

35 ATTTTCACTAAGAATTCTGATTATGATACGATTTCTAAGATAGCCA 

4125.7 

ATGAAAAAACAAAATAATGGTTTAATTAAAAATCCITT^ 

CCAGTATTTCTATTCTGGGAATAACTCAGGAGGAAGTCAGCAAATCAACTATACTGAGTTGGTACAAGAAATTAC 
40 CGATGGTAATGTAAAAGAATTAACTTACCAACCAAATGGTAGTGTTATCGAAGTTTCTGGTGTCTATAAAAATCCT 
AAAACAAGTAAAGAAGAAACAGGTATTCAGTTTTTCACGCCATCTGTTACTAAGGTAGAGAAATTTACCAGCACT 
ATTCTTCCTGCAGATACTACCGTATCAGAATTGCAAAAACTTGCTACTGACCATAAAGCAGAAGTAACTGTTAAGC 
ATGAAAGTTCAAGTGGTATATGGATTAATCTACTCGTATCCATTGTGCCATTTGGAATTCTATTCTTCTTCCT 
TCTATGATGGGAAATATGGGAGGAGGCAATGGCCGTAATCCAATGAGTTTTGGACGTAGTAAGGCTAAAGCAGCA 
45 AATAAAGAAGATATTAAAGTAAGATTTTCAGATGTTGCTGGAGCTGAGGAAGAAAAACAAGAACTAG 
GTTQAGTTCTTAAAAGATCCAAAACGATTCACAAAACTTGGAGCCCGTATC 

CTCCGGGGACAGGTAAAACTTTGCTTGCTAAGGCAGTCGCTGGAGAAGCAGGTGTTCCATTCnTTAG 
TTCTXiACTTTGTAGAAATGTTTGTCGGAGTTGGAGCTAGTCGTGTTC 

GCACCAGCTATCATCTTTATCGATGAAATTGATGCTGTTGGACGTCAACGTGGAGTCGGTCTCGGCGGAGGTAATG 
50 ACGAACGTGAAGAAACCTTGAACCAACITTTGATTGAGATGGATGGTTTTGAGGGAA 

TCGCTGCGACAAACCGTTCAGATGTACTTGACCCTGCCCITTTGCGTCCAGGACGTTTTGAT^ 
TGGTCGTCCrGATGTTAAAGGTCGTGAAGCAATCTTGAAAGTTCACGCTAAGAATAAGC 
CTTTAGCAGAAGATGTTGATTTGAAATTAGTGGCTCAACAAACTCCAGGCTTTGTTGGTGCT 
CrTGAATGAAGCAGCTTTAGTTGCTGCTCGTCGCAATAAATCGATAATTGATGCn^ 
55 GATAGAGTTATTGCTGGACCTTCTAAGAAAGATAAGACAGTTTCACAAAAAGAACGAGAATTGGTTGCT 

GAGGCAGGACATACCATTGTTGGTCTAGTCTTGTCGAATGCTCGCGTTGTCCATAAGGTTACAATTGTACCACGCG 
GCCGTGCAGGCGGATACATGATTGCAGTTCCTAAAGAGGATCAAATGCTTCTATCTAAAGAAGATATGAAAGAGC 
Aj\TTGGCTGGCTTAATGGGTGGACGTGTAGCTGAAGAAATTATCITTAATGTCCAAACCACAGGAGC^ 
ACTTTGAACAAGCGACACAAATGGCACCTGCAATGGTTACAGAGTACGGTATGAGTGAAAAACTTGGCCCAGTAC 
60 AATATGAAGGAAACCATGCTATGCTTGGTGCACAGAGTCCTCAAAAATCAATTTCAGAACAAACAGCTTATGAAA 
TTGATGAAGAGGTTCGTTCATTATTAAATGAGGCACGAAATAAAGCTGCTGAAATTATTCAGTCAAATCGTGAAAC 
TCACAAGTTAATTGCAGAAGCATTATTGAAATACGAAACATTGGATAGTACACAAATTAAAGCTCTTTACGAAAC 
AGGAAAGATGCCTGAAGCAGTAGAAGAGGAATCTCATGCACTATCCTATGATGAAGTAAAGTCAAAAATGAATGA 
CGAAAAATAA 

65 
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4125.10 

ATGAGGGAACCAGATTTTTTAAATCATT^ 
TTTCTGGTGGATTAGATTCCATGTTTCTATTTAAGGTATTGT 

CTAGCTCATGTGAATCATAAGCAGAGAATTGAATCAGATTGGGAAGAAAAGGAATTAAGGA AGTT GGCTGCTGAA 
5 GCAGAGCrrCCTATTTATATCAGCAATTTTTCAGGAGAATTTTCAGAAGCG 

TTTTTCAAGAGGTCATGAAAAAGACAGGTGCGACAGCTTTAGTCACTGCCCACCATGCTGATGATCAGGTGGAAA 
CGATTTTTATGCGCnTGATTCGAGGAACTCGCTTGCGCTATCTATCAGGAATt 
GATAGAAATC^TTCGTCCCTTCITGCATTTTCAGAAAAAAGACTTTC 
AATCAGGAGAATCATTATTTTCGAAATCGTATTCGAAATTCTtACTTACCAGAAT^^ 
1 0 TTAGGGATGCAATCTTAGGCATTGGCAATGAAATTTTAGATTATGATTTGGCAATA 
TAATGTGGAAGATTTACAGCAGTTATTTTCITACTCTGAGTCT^ 
GTTTTCCAGATTTGAATCTTACAAAAGCTCAGTTTGCTGAAGTTCAGCAGATT^ 

TCATCCGATTAAAAATGGCTATGAATTGATAAAAGAGTACCAACAGTTTCAGATTTGTAAAAT CAGTCCGCA GGCT 
GATGAAAAGGAAGATGAACTTGTGTTACACTATCAAAATCAGGTAGCTTATCAAGGATATTTATTTTCT^ 
15 TTCCATTAGAAGGTGAATTAATTCAACAAATACCTGTTTCACGTGAAACATCCATACACATTCGTCATCGAAAAAC 

AGGAGATGTTTTGATTAAAAATGGGCATAGAAAAAAACTCAGACGTTTATTTATTG 

ATTTGAAAATCCCTATGGAAAAGAGAAACTCTGCTCTTATTATTGAGCAATTT 

AATTGCGACCAATAATTTGAGTAAAAAAACGAAAAATGATATAATGAAC^CTGTACTTTATATAGAAAAAATAGA 
TAGGTAA 

20 

4126.1 

ATGAAGCGTTCTTCTCTTTTAGTTAGAATGGTTATTTCCAT 

ACTrTCTACTATCAATCAAGTTCTTCAGCCATTGAGGCCACCATTGAGGGCAACAGCCAAACGACCATCAGCCAG 
ACTAGCCACTTTATTCAGTCTTATATCAAAAAACTAGAAACCACCTCGACTGGTTTGACCCAGCAGACGGATGTTC 

25 TGGCCTATGCTGAGAATCCCAGTCAAGACAAGGTCGAGGGAATCCGAGATTTGTTTTTGACCATCTTGAAGTCAGA 
TAAGGACTTGAAAACTGTTGTGCTGGTGACCAAATCTGGTCAGGTCATTTCTACAGATGACAGTG TGCA GATGAA 
AACTTCCTCTGATATGATGGCTGAGGATTGGTACCAAAAGGCCATTCATCAGGGAGCTATGCCTGTTTTGACT 
GCTCGTAAATCAGATAGTCAGTGGGTCATTTCTGTCACTCAAGAACTTGTTGATGCAAAGGGAGCCAATCTTGGTG 
TGCTTCGTTTGG AT A ' 1 ' 1 TCTI ' ATG AAACTCTGG A AG CCTATCTC AATC AACTCC AGTTG GGG C AGCAG G G CTTTG C 

30 CTTCATTATCAATGAAAACCATGAATrrGTCTACCATCCTCAACACACAGTTTATAGTTCGTCT 

GCTATGAAACCCTACATCGATACAGGTCAGGGTTATACTCCTGGTCACAAATCCTACGTCAGTCAAGAGAAGATT 
GCAGGAACTGATTGGACGGTGCTTGGCGTGTCATCATTGGAAAAGTTAGACCAGGTTCGGAGTCAGCTCTTGTGG 
ACCTTGCTTGGGGCCAGTGTCACATCTCTTCTTGTCTGTCTCTGCTTAGTGTGGTTCAGTCTTAAACGCT 
TCCTTTGAAGGATTTGAGAGAAACCATGTTGGAAATTGCTTCTGGTGCTCAAAATCT^ 

35 GCCTATGAACTGAGAGAAGTAACTCGCCAATTTAATGCTATGTTGGATCAGATTGATCAGTTGATGGTAGCTATTC 
GTAGCCAGGAAGAAACGACCCGTCAGTACCAACTTCAAGCCCTTTCGAGCCAGATTAATCCACATTTCCTCTATAA 
CACTTTGGACACCATCATCTCGATGGCTGAATTTCATGATAGTCAGCGAGTGGTGCAGGTGACCAAGTCCTTGGCA 
ACCTATTTCCGCTTGGCGCTCAATCAAGGCAAGGACTTGATTTGTCTCTCTGACGAAATCAATC ATGTC CGCCAGT 
ATCTCTTTATCCAGAAACAACGCTATGGAGATAAGCTGGAA™ 

40 AGTCTTACCCAAGCTGGTCCTACAACCCCTTGTAGAAAATGCTCTTTACCATGGCATTAAGGAAAAGGAAG 

GGGCCATATTAAACTTTCTGTCCAGAAACAGGATTCGGGATTGGTCATCCGTATTGAGGATGATGGCGTTGGCTTC 
CAAGATGCTGGTGATAGTAGTCAAAGTCAACTCAAACGTGGGGGAGTTGGTCTTCAAAATGTCGATCAACGGCTC 
AAACTTCATTTTGGAGCCAATTACCATATGAAGATTGATTCTAGACCCCAAAAAGGGACGAAAGTTG 
AT AAAT A G AAT AG AAACT A G CT AA 

45 

4126.7 

ATG AAGCGTTCTTCT L * IT 1U ' AGTTAG AATGGTTATTTCCATCnTTCTGGTCTTTCTCATTCTCCTAGCTCTGGTTGG A 
ACTITCTACTATCAATCAAGTTCTTCAGCCATTGAGGCCACCATTGAGGGCAACAGCCAAACGACCATCAGC 

50 ACTAGCCACTTTATTCAGTCITATATCAAAAAACT^ 

TGGCCTATGCTGAGAATCCCAGTCAAGACAAGGTCGAGGGAATCCGAGATTTGTTITTGACCATCTTGAAGTCAGA 
TAAGGACTTGAAAACTGTTGTGCTGGTGACCAAATCTGGTCAGGTC^TTTCTACAGATGACAGTG TGCA GATGAA 
AACTTCCTCTG ATATG ATGG CTG AGG ATTGGTACCAAA AGG CCATTCATCAGGG AGCTATG CCTGTTTTG ACTCCA 
GCTCGTAAATCAGATAGTCAGTGGGTCATTTCTGTCACTCAAGAACTTGTTGATGCAAAGGGAGCCAAT CrTGG TG 

55 TGCnTCGTrTGGATATTTCTTATGAAACTCTGGAAGCCTATCTCAATCAACTCCA 

CTTCATTATCAATGAAAACGATGAATTTGTCTACCATCCTCAACACACAGTITATAGTTCGTCTAGCAAAATGGAG 
GCTATGAAACCCTACATCGATACAGGTCAGGGTTATACTCCTGGTCACAAATCCTACGTCAGTCAAGAGAAGATT 
GCAGGAACTGATTGGACGGTGCTTGGCGTGTCATCATTGGAAAAGTTAGACCAGGTTCGGAGTCAGCTCTTGTGG 
ACCTTGCTTGGGGCCAGTGTCACATCTCTTCTTGTCTGTCTCTGCTC 

60 TCCirrGAAGGATTTGAGAGAAACCATGTTGGAAATrGCITCTGGTGCTCAAAA 

GCCTATGAACTGAGAGAAGTAACTCGCCAATTTAATGCTATGTTGGATCAGATTGATCAGTTGATGGTAGCT 
GTAGCCAGGAAGAAACGACCCGTCAGTACCAACTTCAAGCCCTrTCGAGCCAGATrAATCCACATTTCCTCTATAA 
CACTTTGGACACCATCATCTGGATGGCTGAATTTCATGATAGTCAGCGAGTGGTGCAGGTGACCAAGTCCTTGGCA 
ACCTATTTCCGCTTGGCGCTCAATCAAGGCAAGGACTrcATTTGTCTCTCTGACGAAATCAA 

65 ATCTCnTTATCCAGAAACAACGCTATGGAGATAAGCTGGAATACGAAATTAATGAAAATGTTGCCTT^ 
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AGTCTTACCCAAGCTGGTCCTACAACCCCTTGTAGA^ 
GGGCCATATTAAACTTTCT.GTCCAGAAACAGGATTCGGGATTC 

CAAGATGCTGGTGATAGTAGTCAAAGTCAACTCAAACGTCGGGGAGTTGGTCTTCAAAATGTCGATCAACG 
AAACTTCATTTTGGAGCCAATTACCATATGAAGATTGATTCTAGACCCCAAAAA 
5 AT AAATAG AAT AG AAACTAGCTAA 

4127.4 

ATGTTTTTTAAATTATTAAGAGAAGCTCTTAAAGTCAAGCAGGTTC ATCGT 

TTTGGTCTTTCGTATCGGAACTAGCATTACAGTTCCTGGTG^ 
10 CCTTCnTAAACATGTTGAGCTTGGTGTCGGGGAATGCCCTAAAAAACTTTT 

CTATATCACCGCTTCTATTGTTGTCCAACTCTTGCAAATGGATATTTTACCCAAGTTTGTA 

GGGGAAGTAGGTCGAAGAAAATTGAATGAAGCTACTCGTTATATTGCTCTAGT^ 

GGATTACAGCTGGTTTTAATACCTTGGCTGGAGCTCAA 

GATTGGTATCATCTTAACAGCTGGTAGTATGATTGTCACTTGGTTGGGTC 
15 AACGGTGTTTCCATGATTATCTTTGCCGGGATTGTTTCCTCAATTCCAGAGATGATTCAGGGCATCTA 

ACTTTGTGAACGTCCCAAGTAGCCGTATCACTTCATCTATCATTTTCGTAATCATTTTG 

ATTATTTACTTTACAACTTATGTTCAACAAGCAGAATACAAAATTCCAATCCAATA 

CTCCATCTAGCTCTTACCXrCCGTTAAAAGTAAACCCTGCTGGAGTT^ 

GCAGCGCCTGCGGCTATTCTTCAGTTTTTGAGTGCCACAGGTCATGATTGGGCTTGGGTAAGGGTAGCACAAGAGA 
20 TGTTGGCAACTACTTCTCCAACTGGTATTGCCATGTATGCTTTGTTGATTATTCT 

TACAGATTAATCCTGAAAAAGCAGCAGAGACCTACAAAAGAGTGGTGCCTATATCCATGGAGTTCGTCCTGGTAA 
AGGTACAGAAGAATATATGTCTAAACTTCrrCGTCGTCTTGCAACTGTTGGTTCCCTCTTCCTT 



25 4127.5 

ATGGATATTAGACAAGTTACTGAAACCATCGCCATGATTGAGGAGCAAAACTTCGATATTAGAACCATTACCATG 
GGGATTTCrCTTTTGGACIXn*ATCGATCCAGATATCAATCGTGCTGCGGAGAAAATCTAT 
AGGCGGCTAATTTAGTAGCTGTTGGTGATGAAATTGCGGCTX5AGTTGGGAATTCCTATCGTTAATAA 
GGTGACACCTATTTCTCTGATTGGGGCAGCGACAGATGCGACGGACTACGTGGTTCTGGCAAAAGCGCTTGATAA 
30 GGCTGCGAAAGAGATTGGTGTGGACnTTATTGGTGGTTTTTCTGCCTTAGTACAAAAAGGTTATC 
GAGATTCTCATCAATTCCATTCCTCGCGCTTTGGCTGAGACGGATAAGGTCTGCT 

CCAAGTCTGGTATTAATATGACGGCTGTGGCAGATATGGGACGAATTATCAAGGAAACAGCAA ATCTT TCAGATA 
TGGGAGTGGCCAAGTTGGTTGTATTCGCTAATGCTGTTGAGGACAATCCATTTATGGCG GGTG CCrTTCATGGTGT 
TGGGGAAGCAGATGTTATCATCAATGTCGGAGTTTCTGGTCCTGGTGTTGTGAAACGTGCTTTGGAAAAAG 

35 GGACAGAGCTTTGATGTAGTAGCCGAAACAGTTAAGAAAACTGCCTTTAAA 

GTCAAATGGCCAGTGAGAGACTGGGTGTGGAGTTTGGTATTGTGGACTTGAGTTTGGCACCAACCCCTGCGGTTGG 
AGACTCTGTGGCACGTGTCCTTGAGGAAATGGGGCTAGAAACAGTTGGCACGCATGGAACGACGGCrrG CCTTG GC 
CCTTCTTGAACGACCAAGTTAAAAAGGGTGGAGTGATGGCCTGCAACCAAGTCGGTGGTTTATCTGGTGCCTTTATC 
CCTGTTTCTGAGGATGAAGGAATGATTGCTGCAGTGCAAAATGGCTCTCTTAATTTAGAAAAACTA 

40 CGGCT ATCTGTTCTGTTGGATTGGATATGATTGCCATCCCAGAAGATACGCCTGCTGAAACTATTGCGGCTAT^ 

TGCGGATGAAGCAGCAATCGGTGTTATCAACATGAAAACAACAGCTGTTCGTATCATTCCCAAAGGAAAAGAAGG 

CGATATGATTGAGTTTGCTGGTCTATTAGGAACTGCACCCGTTATGAAGGTTAATGGGGCTTCGTCTGTC 

ATCTCTCGCGGTGGACAAATCCCAGCACCAATTCATAGTTTTAAAAATTAA 

45 4128.1 

ATGACACAGATTATTGATGGGAAAGCTTTAGCGGCCAAATTGCAGGGGCAGTTGGCTGAAAAGACTGCAAAATTA 
AAGGAAGAAACAGGTCTAGTGCCTGGTTTGGTAGTGATtTTGGTTGGGGACAATCCAGCC^ 

CGCAACAAGGAGAGGTCAGCCCTTGCGGCTGGTTTCCGTAGCGAAGTAGTACGGGTTCCAGAGACCATTACTCAA 
GAGGAATTGTTAGACCTGATTGCTAAATACAATCAGGATCCAGCITGGCATGGGATTTTGGTTCAGTTG 

50 CAAAACACATTGATGAAGAGGCGGTTGTATTGGCTATTGACCCAGAAAAGGATGTGGATGGTTTCCATCCTCTAA 
ACATGGGGCGTCTITTGGTCTGGTCATCCAGTCATGATTCCTTCGACACCGGCAGGAATTATGGAAATGTTCCATGA 
ATATGGGATTGACTTGGAAGGTAAAAATGCAGTCGTCATCGGTCGATCCAATAT TGTCG GAAAACCTATGGCCCA 
G CllCUUU ' l GGCAAAGAATGCAACAGTAACCTTGACTCACTCACGTACrCATAATCTTTCCAAGGTGGCTGCAAAA 
GCAGATATTCTGGTTGTTGCAATCGGTCGTGCCAAGTTTGTGACTGCTGACTTTGTCAAACCAGGTGCGGTAGTCA 

55 TTGACGTTGGGATGAACCGCGATGAAAATGGTAAGCTCTGTGGGGATGTTGATTATGAGGCGGTTGCCCCACT^ 

TAGCCACATTACGCCAGTCCCTGGAGGTGTCGGTCCTATGACCATTACTATGCTGATGGAGCAAACCTATCAGGCA 
GCACTTAGGACATTGGATAGAAAATAA 

4128.2 

60 ATGTCTAAATTTAATCGTATTCATTTGGTGGTACTGGATTCTGTAGGAATCGGTGCAGCACCAGATGCTAATAACT 
TTGTCAATGCAGGGGTTCCAGATGGAGCTTCTGACACACTGGGACACATTTCAAAAACAGTTGGTTTGAATGTCCC 
AAAC ATGGCT AAAATAGGTCTTGG A AATATTCCTCGTG AAACTCCTCTT AAG ACTGTAGC AG CTG AAAGC AATCC 
AACTGGATATGCAACAAAATTAGAGGAAGTATCTCTTGGTAAGGATACTATGACTGGACACTGGGAAATCATGGG 
ACTCAACATTACTGAGCCTTTCGATACTTTCTGGAACGGATTCCCAGAAGAAATCCTGACAA AAAT CG 

65 TGAGGACGCAAGGTTATTCGTGAAGCCAACAAACCTTATTCAGGAACGGCTGTTATCTATGATTTTGGACCACGTC 
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AGATGGAAACTGGAGAGTTGATTATCTATACTTCAGCT^ 

TCCTTTGGATGAATTGTACCGTATCTGTGAATACGCTCGTTCGATTACCCTTGAGCGTCCTGCCCTTCTTGGTCGCA 
TCATTGCTCGCCCTTATGTAGGTGAACCAGGTAACTTCACTCGTACGGCAAACCGTCGTGACITGGCTGTATCTCC 
ATTTTTCCCAACTGTTTTGGATAAATTGAATGAGGCTGGTATCGATACTTATGCTGTG 

TTTAACGGTGCTGGTATCAACCATGACATGGGTCACAACAAGTCAAATAGTCATGGAATTGA TACAC TATTGAAG 
ACTATGGGACTTGCnX3AGTTTGAAAAAGGATTCTCATTCACAAACCTAGTTGACTTTC 

GTCGTAATGCTCACGGTTACCGTGATTGCITGCATGAGTTTGATGAACGCTTACCTGAAATTATCGCAGCTATGAG 
AGAGAATGACCTTCTCTTGATTACTGCGGACCATGGAAATGACCCAACGTATGCAGGAACGGATCACACTCGGGA 
ATATATTCCATTGTTGGCCTATAGCCCTGCCTTTAAAGGAAATGGTCTCATTCCAGT^ 
CAGCGACTGTTGCCGATAACTTTGGTGTGGAAACTGCTATC^ 

4129.2 

ATGTTTATTTCCATCAGTGCTGGAATTGTGACATTTTTACTAACT 

TAGAAAGGCGCAAATTACAGGCCAGCAGATGCATGAGGATGTCAAACAGCATCAGGCAAAAGCTGGGACTCCTA 
CAATGGGAGGTTTGGin'l'lCl'lGATTACTTCTGTTTTGGTTGCl'l TCTnTlCQCCCTATTT AGTA GCCAATT CAGCA 
ATAATGTGGG AATGAT TTTG 1 ' 1'CATCTTGGTCTTGTATGGCTTGGTCGG A' 1U' AG ATG ACHTTCTC AAGGT CTTT 
CGTAAAATCAATGAGGGGCTTAATCCTAAGCAAAAATTAGCTCnTCAGCT 

TCTATGAGCGCGGTGGCGATATCCTGTCTGTCTTTGGTTATCCAGTTCATTTGGGATTTTTCTATA1U 
CTTTTCTGGCTAGTCGGTTTTTCAAACGCAGTAAACTTGACAGACGGTGTTC 

GATTAGTTTGTCTGCCTATGGAGTTATTGCCTATGTGCAAGGTCAGATGGATATTCTTCTAGTGATTCT^ 
TTGGTGGTTTGCTCGGTTTCTTCATCTTTAACCATAAGCCTGCCAAGGTCTTTATGGGTGA 

CTAGGTGGGATGCTGGCAGCTATCTCTATGGCTCTCCACCAAGAATGGACTCTCTTGATTATCGGAATTGTGTATG 
TTITrGAAACAACTTCTGTTATGATGCAAGTCAGTTATTTCAAACTGACAGGTGGTA^ 

GCCTGTACATCACCATTITGAGCTTGGGGGATTGTCTGGTAAAGGAAATCCnTGGAGCGAGTGGAAGGTTGACTTC 
TTCTTTTGGGGAGTGGGACTTCTAGCAAGTCTCCTGACCCTAGCAATTTTATATTTGATGTAA 

4133.1 

TTGTTTAAGAAAAATAAAGACATTCTTAATATTGCATTGCCAGCTATGGGTGAAAACTTTTTGCA 
GAATGGTGGACAGTTATTTGGTTGCTCATTTAGGATTGATAGCTATTTCAGGGGTTTCAGTAGCT^ 
CACCATTTATCAGGCGATTTTCATCGCTCTGGGAGCTGCTATTrCCAGTGTTATTTCAAAAAGCA 
GACCAGTCGAAGTTGGCCTATCATGTGACTGAGGCGTTGAAGATTACCTTACT^ 

GTCCATCTTCGCTGGGAAAGAGATGATAGGACTTTrGGGGACGGAGAGGGATGTAGCTGAGAGTGGTGGACTGTA 
TCTATCTTTGGTAGGCGGATCGATTGTTCTCnTAGGTTTAATC 

AATC C ACGTCTGCCTCTCTATGTTAGTTTTTT ATCC AATG CCTTG AAT ATT C I ' l ' lTl ' l C AAGTCT AG CT ATTTTTGTT 

CTGGATATGGGGATAGCTGGTGTTGCTTGGGGGACAATTGTGTCTCGTTTGGTTGGTCrrGTO 

AATTAAAACTGCCTTATGGGAAGCCAACTTTTGGTTTAGATAAGGAACTGTTG 

AGAGCGACTTATGATGAGGGCTGGAGATGTAGTGATCATTGCCTTGGTCGUUlVriU'rGGGACGGAGGCAGTTGCT 

GGGAATGCAATCGGAGAAGTCTTGACCCAGTTTAACTATATGCCTGCCTTTGGCGTCGCTACG^ 

TGTTGGCCCGAGCAGTTGGAGAGGATGATTGGAAAAGAGTTGCTAGTTTGAGTAA^ 

GTTCCTCATGTTGCCCCTGTCCTTTAGTATATATGTCTTGGGTGTACCATTAACTCATCTCTATACGACTGATTCT 
TAG CG GTGG AGGCTAGTGTTCT AGTG ACACTGTTTTCACTACTTGGG ACCCCTATG ACG ACAGG AACAGTCATCTA 
TACGG C AGTCTGGCAGGG ATTAGG AAATG C ACG CCTCCCTTTTTATGCG A CAAGTATAGG AATGTGGTGTATCCGC 
ATTGGGACAGGATATCTGATGGGGATTGTGCTTGGTTGGGGCTTGCCTGGTATTTGGGCAGGGTCTCT 
ATGGTTTTCGCTGGTTATTTCTACGCTATCGTTACCAGCGCTATATGAGCTTGAAAGGATAG 

4135.2 

ATGCAAACTCAAGAAAAACACTCGCAAGCAGCCGTTCITGGCTTGCAGCACTTACTAGCCATGTACT 

ATCCTGGTTCCCATCATGATTGCGACAGCCCTTGGCTATTCAGCTGAGCAGTTGACCTACCTGATTTCTACAGA 

TCTTCATGTGTGGGGTGGCAACCTTCCTCCAACTCCAACTCAACAAATACtTTGGGATTGGACTCCCAGTCGT^ 

TGGAGTTGCATTCCAGTCGGTCGCTCCCTTGATTATGATTGGGCAAAGCCATGGTAGTGGCGCTATGTTTGGTGCC 

CTTATCGCATCTGGGATTTACGTGGTTCTTGTTTCAGGCATCTTCTGAAAAGTAGCCAATCT 

AACAGOATCTGTTATTACCACGATTGGTTTAACCTTGATCCCTC^ 

GAGCCAACTGGTCAAAGTCTCTTGCTTGCAGCTATTACTGTTCT^ 

GATTTATCAAGTCTATCTCTATTTTGATTGGTCTC 

iTCTCTCCTGTTGCGGTAGCTCCACTTGTCCATGTCCCAACT^ 

ATCTATTGTCATGATGTGTATCATCGCAACGGTGTCTATGGTTGAGTCAACnXJGTGTTTATCTGGCCTTGTCT 
TCACAAAGGATCCAATCGACAGCACGCGGCTTGGCAACGGATACCGCGCAGAAGGTTTGGCCGTACTTCTCGGAG 
GAATCTTTAACACCTTCCCTTACACCGGATTTTCACAAAACGTTGGTTTGGTTAAATTGTCAGGCATCA^ 
CCTGCCAATCTACTACGCAGCTGGTTTCCTGGTTCTCCTTGGACTGCTTCCTAAGTTTG 

TTCCAAGCTCCGTCCTCGGTGGTGCCATGCTGGTAATGTTTGGTTTTGTATCAATTCAAGGGATGCAAATCCTCGC 
CCGTGTTGACTTTGCTAACAATGAACACAACrrCCTTATCGCAGCTGTTTCAATCGCTGCAGGTGTCGGTCTCAAC 
AACAGTAATCTCTTTGTCAGCATGCCGACAGCCTTCCAAATGTTCTTCTCAAACGGAATCGTCGTAGCCAGCCTAC 
TCGCTATTGTCCTCAATGCCGTATTAAATCATAAAAAGAAATAA 

4136.2 
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ATGAAAGATAGAATAAAAGAATATTTACAAGA 

AAAGACAGTTCCAAGGATTTTCXrrGAGTTGATTAAAACCITGTCCI^ 

AAG AAG ATGGT AGTCTG ACATTAG AA ATTAAG AAAAAACATG AG ATTACCCTCAAGGGG A r i; r I 1 CATG CCCAT A 
AAAATGGCTTTGGCTTTGTTAGTCTGGAAGGCGAGGAGGACGACCTT 
5 TATTGATGGTGATACCGTCGAGGTAGTGATTAAGAAAGTCGCTGACCGCAATAAGGGAACAGCAGCAGAAGCCAA 

AATTATTGATATCCTAGAACACAGTTTGACAACAGTTGTC^ 

GCTGGCTATATTCGTTCAAAAAATCAGAAAATCAGTCAACCGATTTATGTTAAGAA ACCA GCCCTAAAATTA 

GG AACAG AAGTTCTCAA AGTCTTT ATCG ATAAATACCCAAG C AAG AAACATG A I" I" ! CTTTGTCGCG AGTGTTCTCG 

ATGTAGTGGGACACTCAACGGATGTCGGAATTGATGTTCTTGAGGTCTTGGAATCAATGGACATTGTATCCGAGTT 

10 TCCAGAAGCimTGTTAAGGAAGCAGAAAGTGTGCCTGATGCTCCGTCTCAAAAGGATATGGAAGGTCGTCTGGA 
TCTAAGAGATGAAATTACCTTTACCATTGACGGTGCGGATGCCAAGGACTTGGACGATGCAGTGCATATCAAGGC 
TCTGAAAAATGGCAATCTGGAGTTTGGGGTTCACATCGCAGATGTTTCHTATTATGTGACCGAGGGGTCT^ 
GACAAGGAAGCCCTTAACCGTGCGACTTCTGTTTACGTGACAGACCGAGTGGTGCCAATGCTTCCAGAACGACTA 
TCAAATGGCATCTGCTCTCTCAATCCCCAAGTTGACCGCCTGACCCAGTCTGCTATTAT 

1 5 GGAGATTGATAAACATGGTCGTGTGGTG\ACTATACCATTAC^CAAACAGTTATCAAGACCAGTTTTCGTATGACC 
TATAGCGATCTCAATGATATCCTAGCTGGCGATGAAGAAAAGAGAAAAGAATATCATAAAA TTGTA TCAAGTATC 
GAACTCATGGCCAAGCTTCATGAAACTTTAGAAAACATGCGTGTGAAACGTGGAGCTCTCA^ 
AAGCGAAGATTTTAGTGGATAAACAAGGTAAGCCTGTTGATATCGTTCT 
TGATTGAGTCTTTTATGTrGATGGCTAATGAAACAGTTGCCGAACATTTCAGCAAGTTG 

20 CGAATTCACGAGGAGCCTAAGGCTGAAAAGGTTCAGAAGTTTATTC 

ATGGAACTGCCAGTGAGATTAGTCAGGAGGCACnTCAAGACATCATGCGTGCTGTTGAGGGAGAACCTTATGCAG 
ATGTATTGTCCATGATGCTTCTTCGCrCTATGCAGCAGGCTCGTTATTCGGAGCACAATCACGGCCACTATGGACT 
AGCTGCTGACTATTATACTCACTTTACCAGTCCAATTCGTCGTTATCCAGACCTTCT^ 

ATTACGGCCGTTCTAAGGAAATAGCAGAGCATTTTGAACAAGTGATTCCAGAGATTGCGACCCAGTCTTCCAACC 
25 GTGAACGTCGTGCCATAGAAGCTGAGCGTGAAGTCGAAGCCATGAAAAAGGCTGAGTATATGGAAGAATACGTGG 
GTGAAGAGTATGATGCAGTTGTATCAAGTATTGTCAAATTCGGTCTCTTTGTCGAATTGCCAAACACAGTTGAAGG 
CTTGATTCACATCACTAATCTGCCTGAATTTTATCATTTCAATGAGCGTGATTTGACT 

GGTATCACTTTCCGAGTGGGTCAGCAGATCCGTATCCGTGTTGAAAGAGCGGATAAAATGACTGGAGAGATTGAT 
TTTTCATTCGTACCTAGTGAGTTTGATGTGATTGAAAAAGGCTTGAAACAGTCTAGTCGT 
30 AGTGGCAGAGGGCGTGATTCAAATCGTCGTTCGGATAAGAAGGAAGA CAAGAG AAAATCAGGACGCTCAAATGA 
TAAGCGTAAGCATTCACAAAAAGACAAGAAGAAAAAAGGAAAGAAACCnTTTTACAAGGAAGTAGCTAAGAAAG 
GAGCCAAGCATGGCAAAGGGCGAGGGAAAGGTCGTCGCACAAAATAA 



ATGGGCACAACAGGATTTACAATAATTGACTTAATTATCTTGATTGTTTATTTACTTGCGGTC 
CTATTTCTCTAAAAAAGAGATGAAAGGAAAAGAGTTCTTTAAAGGAGATGGTTCGGTTCCTTG 
GTATCCATTTTTGCCACAATGCTCAGTCCGATTTCCTTCTTGGGACTCGCT GGTA GCT 
TTTATGGTTTGCTCAATTAGGGATGGTAGTAGCTATTCCACTGACAATTCGTTTTATCTTA 
TAGACATCGATACGGCATATGATTACrrGGATAAACGTTTTAATTCTAAAGCACTTCGTAT^ 

GTTTATTATTTATCAATTGGGACGTATGTCTATCATTATGTACCTCCCATCAGCTGGTTTATCAGTATTGACAGGAA 
TTGACATCAATATTTTGATTATTTTGATGGGTGTAGTTGCAATTGTTTATTC TTATA 
TTATGGACAGACTTTATTCAAGGTGTGATTCTGATTAGTGGTGTCGTTTTAGCTTTATTTGTACT 
TAAAGGTGGCTTTGGTGCAGT AGCAG AAACATTAGCAAACGGG AAATTCCTTGCnXjCAAATG AAAAAC ITT ICG A 
TCCTAACITGCITTCAAACTCCATCTTTTTAATTGT^ 

TCAAGATTTGGTTCAACGTTTTACTACAACACAAAATATTAAGAAACITAATAAGATGTTGTTCA 
TTGTCACTTGCAACTGCAACAGTCTTTTACTTGATTGGTACAGGCnTGTACGTATT 

ATAGTGCAGCTAGCAATATCCCTCAAGACCAAATCTTTATGTACTTTATTGCATACCAGTTACCAGTAGGTATCAC 
AGGTTTGATCTTGGCAGCGATTTATGCAGCATCTCAATCAACTATTTCAACAGGTTTGAACT 

TGGACATTGGATATTCAAGATGTCATTTCTAAAAATATGTCAGACAATCGTCGTACGAAAATTGCACAATTCGTAT 
CTCTAGCAGTAGGTTTATTCTCAATTGGTGTTTCCATTGTCATGGCTCACTCAGATA 
TTCAATAGTTTC^TGGGACITCrrACTTGGTCTACITGGTGGTC 

TAAACAAGGTGCnTATGCAGCGCTGATTGTATCAACCATCGTCATGGTATTTATTAAATACITCCTTCCTCCAACA 
GCTGTTAGCTACTGGGCATATTCATTGATTTCAATCTCTGTATCAGTAGTTTCAGGTTATATTGTATCT 
TGGAAATAAAGTATCTGCACCTAAATATACAACGATTCATGATATTACAGAAATTAAAGCGGATTCAAGTTGGGA 
AGTTCGTCACTAA 

4138.1 

ATG AAATTT AGT AAAAAATAT AT AG C AG CTG G ATC AG CTGTT ATCGTATCCTTG AGTCT ATGTG CCT ATG C A CT AA 
ACCAGCATCGTTCGCAGGAAAATAAGGACAATAATCGTGTCTCTTATGTGGATGGCAGCCAGTCAAGTCAGAAAA 
60 GTGAAAACnTGACACCAGACCAGGTTAGCCAGAAAGAAGGAATTCAGGCTGAGCAAATTGTAATCAAA ATTAC AG 
ATCAGGGCTATGTAACGTCACACGGTGACC^CTATCATTACTATAATGGGAAAGTTCCTTATGATGCCCTCnTTAG 
TGAAGAACTCTTGATGAAGGATCCAAACTATCAACTTAAAGACGCTGATATTGTCAATGAAGTCAAGGGTGGTTA 
TATCATCAAGGTCGATGGAAAATATTATGTCTACCTGAAAGATGCAGCTCATGCTGATAATGTTCGAACTAAAGAT 
GAAATCAATCGTCAAAAACAAGAACATGTCAAAGATAATGAGAAGGTTAACrCTAATGTTGCTGTAGCAAGGTCr 
65 CAGGGACGATATACGACAAATGATGGTTATGTCnTTAATCCAGCTGAT ATT ATCG AAGATACGGGTAATGCTT ATA 
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TCGTTCCTCATGGAGGTCACTAfCACTACATTCCCAAAAGCGATTTATCTGCTAGTGAATTAGCAGCAGCTAAAGC 

ACATCTGGCTGGAAAAAATATGCAACCGAGTCAGTTAAGCTATTCTTCAACAGCT AGTGAC AATAACACGCAATC 

TGTAGCAAAAGGATCAACTAGCAAGCCAGCAAATAAATCTGAAAATCTCCAGAGTCTTTTGAAGGAACTCTATGA 

TTCACXTAGCGCCCAACGTTACAGTGAATCAGATGGCCTGGTCTTTGACCCnXjCTAAGATTATCAGT 

AATGGAGTTGCGATTCCGCATGGCGACCATTACCACTTTATTCCTTACAGCAAGCTTTCTGCCT 

TTGCCAGAATGGTGCCTATCAGTGGAACTGGTTCTACAGTTTCTACAAATGCAAAACCTAATG 

TCTAGGCAGTCTTTCAAGCAATCC ITCITC I'ri'AACG ACAAGTAAGGAGCTCTCTTCAGCATCTG ATGGTTATATTT 

TTAATCCAAAAGATATCGTTGAAGAAACGGCTACAGCTTATATTGTAAGACATGGTGATCATTTCC^ 

AAAATCAAATCAAATTGGGCAACCGACTCTTCCAAACAATAGTCTAGCAACACCnTCTCCATCTCTTCCAATCAAT 

CCAGGAACTTCACATGAGAAACATGAAGAAGATGGATACGGATTTGATGCTAATCGTATTATCGCTGAAGATGAA 

TCAGGTTTTGTCATGAGTCACGGAGACCACAATCATTATTTCTTCAAGAAGGACTTGACAGAAGAGCAAATTAAG 

GTGCGCAAAAACATTTAG 

4139.1 

ATGAAAAAAAGAGCAATAGTGGCAGTCATTGTACTGCTTTTGATTGGGCTGGATCAa 

TCCAGCAGATTCCACTGGGTGAAGTGCGCTCCTGGATCCCCAATTTCGTTAGCTTGACCTACCTGCAAAATCGAGG 
TGCAGCCITTTCTATCTTACAAGATCAGCAGCTGTTATTCGCTGTCATTACTCT 

GGTATTTACATAAACACATGGAGGACTCATTCTGGATGGTCTTGGGTTTGACTCTAATAATCGCGGGTC 
AAACTTTATTGACAGGGTCAGTCAGGGCTTTGTTGTGGATATGTTCCACCITGACTTT 

ATGTG G C AG AT AG CT ATCTG ACG GTTGG AGTG ATT ATTTT ATTG ATTG CAATG CT AAA AG AGG AAATAAATGG AA 
ATTAA 

4139.5 

ATGAATAC AAATCTTG CAAGTTTTATCGTTGG ACTG ATCATCG ATG AAAACGACCGTTTTTACTTTGTG CAAAAGG 

ATGGTCAAACCTATGCTCTTGCTAAGGAAGAAGGCCAACATACAGTAGGGGATACGGTCAAAGGTTTTGCATACA 

CGGATATGAAGCAAAAACTCCGCCTGACAACCTTAGAAGTGACTGCCACTCAGGACCAATTTGGTTGGGGACGTG 

TCACAGAGGTTCGTAAGGACTTGGGTGTCTITTGfGGATACAGGCCTTCCTGACAAGGAAATCGTT 

TATTCTCCCTGAGCTCAAGGAACTCTGGCCTAAGAAGGGCGACCAACTCn'ACATCCGTCTTGAAGTGGATAAG 

AGACCGTATCTGGGGCCTCTTGGCTTATCAAGAAGACTTCCAACGTCTTGCTCGTCCrGCCTAC 

AACCAAAACTGGCCAGCCATTGTTTACCGTCTCAAGCTGTCAGGAACTTTTGTI^ 

TTGGTTrrATTCATCCTAGCGAGCGTTACGCAGAGCCACGTT^^ 

CCGTGAAGTCGACCGCACTCTGAACCTCrCCCTCAAACCACGCTCCTTTGAAATGTrGGA 

ATTTTGACTTATrrGGAAAGCAATGGCGGTTTCATGACCTTAAATGACAAGTCATCTCCAGACGACATCAAGGC 
CCTTTGG C ATTTCTAAAGGTC AGTTC AAG A AAG CTTT AGGTGGTCTT ATG AAGG CTG GT AAAATC AAG C AGG ACC A 
GTTTGGGACAGAGTTGATTTAG 

4139.8 

ATGAAAGATGTTAGTCTATTTTTATTGAAAAAAGTITTCAAAAGCCGCTTAAACT 
ATCTGTACTCGGTGTTACCTTTTATTTAAATAGTCAGACTGCAAACTCACACAGC^ 

CGCATTGCAGCCAACGAGAGGGCTATCAATGAAAATGAAGAGAAACTCTCCCAAATGTCTGATACCAGC TCGGA G 
GAATACCAGTTTGCTAAAAATAATTTAGACGTGCAAAAAAATCTTTTC 

TAAAAGAAGGGCGCTGGAAAGAAGCCTACTATTTGCAGTGGCAAGATGAAGAGAAGAATTATGAATTTGTATCAA 
ATGACCCGACTGCTAGCCCTGGCTTAAAAATGGGGGTTGACCGCGAACGGAAGATTTACCAAGCCCTGTATCCCT 
TGAACATAAAAGCACATACTTTGGAGTITCCGACCCACGGGATTGATCAGATTGTCTGGATTTTAGAGGTTATCAT 
CCCAAGTTTGTTTGTG GTTG CT ATT ATTTTT ATGCT AAC AC AACT ATTTG C AG AAAG AT ATC AAAATC ATCTGG AC 
ACAGCTCACITATATCCTGTTTCAAAAGTGACATTTGCAATATCCTCTCTTGGAGTTGGAGTGGGATAT^ 
TGCTGTTTATCGG AATCTGTGGCri" 1 TLTITI CT AGTGGG AAGTCTG ATAAGTGGTTTTGG AC AGTTAGATT ATCCC 
TACCCAATTTATAGCTT AGTG AATCAAGAAGT AACT ATTGGGAAAATACAAGATGTATTATTTCCTGGCT^ 
TAGCnTClTAGCCITTATCGTCATTGTGGAAGTTGTGTA 

CTCTTTCn^rCACTCATTGGGATTGTTGGCnTATTGTTTGGTATCCAAACCATTCAGCCT 
TCTGATTCCCITTACrTACrTGCCnTCAGTGGAGATTTTATCT^ 

T AAATTGG AGC ATGGG AATGGTCTTACTTCCTTG CCTG ATTATCITTTTGCTATTGGG A ATTCTATTT ATTG AAAG A 
TGGGGAAGTTCACAGAAAAAAGAATTTTTTAATAGATTCTAG 

4141.1 

ATGATGAAGTTCATATTGGATATTGTTAGTACACCAGCT ATTTT AGTAGCTTTAATTGCAATCTTAGGATTAGT^ 
TCAGAAGAAGAAATTACCTCATATTATTAAAGGTGGAATTAAGACCTrrc 

G CAGG AATTGT AC AAAATTCTTT AAATCC ATTTGGT ACC ATGTTTG AG C ATG C ITTl CATTT ATCTGG CGTTGTGCC 
GAATAATGAAGCAATTGTAGCTGTAGCTTTAACAACATATGGCTCAGCTACTGCAATGATTATGTTTGCAG^ 
GTGTTC AAT ATCTT A ATCG CTCGTTTTACTCG ATTT AAAT AT A n " H " H " 1* AAC AGGGC ACCACACTCT AT AT ATGG C 
ATGTATGATTGCGGTCATTTTATCAGTTGCTGGCTTTACTAGCTTGCCTCTCATCITACT 

GTATTATTATGAGTATTTCCCCAGCATTTGTGCAAAAATATATGGTTCAATTAACTGGAAATGACAAGGTAGCTTT 

AGGTCATTTCAGTTCITTGGGATATTGGTTGAGTGGTTTTACTGGTAGCCTT 

GAGGACATTAAATTTCCAAAGAGTTTAGCTTTT^ 

TTACATTATTGTAGCTATCTTTGCAGGGTCAGAATATATAGAAAAAGAAATCAGTAGTGGTACAAGTGGTCTAGTT 
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TATGtnTTACAATTAGCAGGTCAATTTGCAGCAGGGGTATTT 
AATTGTTCC^GCCITTAAAGGTATTTCAGAGCGTCTTG 

ATACTTATGCACCCAATGCAGTTCTAATTGGATTTATCTCTAGTTTTGTTGGTGGTT^ 
ATTGCTTGAGGAACGGTTGTTATCTTACCAGGTGTTGTGCCTCATTTCTTCT 
5 G AATGCATCTG GTGGTGTTCGTGG AGCCACTATTGG AGC ATTTTT ACAAGGTATTTTAATC ACT I" 1 ' 1TC 1 'I CCAGTCT 

TTTTAATGCCAGTTTTGGGAGGACTTGGTTTCCAAGGATCAACTTTCTCAGATGCAG 
ATTTTAGGAATGTTAAATCAATTTGGCTCACAAGCAGGCATTGTGATTGGTC^ 
TGGAGTATCCTTTATTAAAAAGCC ATCTG CAACG G AGG AATAA 

10 4142.3 

ATG ATT AAAACATTTCTCTCTG CCCTTTCGGTC ATTCTCTTTTCTATCCCT ATC AT AACTT ATTC 11 T IT TCCC ATCT 

TCTAATCTTAACATTTGGCTATCTACCCAACCTATCTTGGCACAGATTTATGCCT^ 

GGCTGCTATTTTAAGlTlUlTATITnTITCCTATCTTTTT^ 

TTTTGCTCTTACT ATCGCTC AT ATTACT ATT ATTCGG AAC AG ATAAAACCCI TTCT1 CTGCA TC AAAT AAG A CT AA A 
1 5 ACCrTAAAATTAGTAACTTGGAACGTCGCTAATCAAATAGAAGCACAACATATrGAGCGAAI-m 

ACGCCGATATGGCTATATrCCCrGAACTAGCTACCAATATCAGAGGTGAGCAAGAAAACCAGAGAATCAAACTAT 

TGTITCATCAAGTTGGACTTTCTATGGCCAACTATGATATTrrCACTrCTCCACCTACCAATAGTC 

GTGACTGTGATTGTCAAGAAAAGTTATGGTTTCTATACAGAAGCTAAAACT^ 

TTGTATTACATTCGAGAAAACAAAATATACCAGATATCATTGCCTTGCATACTGCGCCTCCTCTGCCAGGTTTAAT 
20 GGAAATCTGGAAGCAAGACTTAAACATCATTCATAATCAATTGGCTTCAAAATATCCAAAGGCT 

GATTTTAATGCAACTATGCGTCATGGAGCACTTGCAAAAATAAGCTCTCATAGGGACGCATTAA ATGC ACT 

CCTTTTGAAAGAGGAACTrGGAATAGCCAAAGTCCAAAACTTTTTAA 

AAAACCACTACTATGTTAAAGATTTAGACATTGTAAGTTTTCAAAACTCTC 

CACATTTTAA 

25 

4142.4 

ATGAATCCAATCCAAAGATCnTGGGCTTATGTCAGCAGAAAGCGACTGAGAAGTTTTA 

TGGTCTTATTGGCCGGAATTTCAGCCTGTTTGACTCTGATGAAGTCCAACAuAAACAGTAGAAAGCAATCTTTAT 
ATCACTQ^TACATCTTTTTCT ATTAAGAAGATAGAGAATGGTCAGACATTCAAGTTGTCAGACCTAGCATCT 

30 AGC AAG ATT AAGGG GCTGG AAAATGTCTCTCCTG AACTTG AG ACGGTCG C AAAACTAAAAG ACA AGG AAGC AGTG 

ACTGGCGAGCAGAGCGTGGAGCGTGATGATTTATCAGCTGCAGACAATAACnTGGTTAGCTTAACGGCTCTTGAG 
GATTCATCCAAGGATGTAACCrTTACCAGTTCGGCITTCAATCTAAAAGAAGGGCGACACCT^ 
TCCAAGAAAATCCTTATCCACGAAGAATTGGCTAAGAAGAACGGT(nTTCGCTTC ATGAC AAGATTG 
CTGGTCAGTCTGAATCTGGAAAAGGACAAACAGTAGAGTTTGAGATTATCGGCATCTTTTCTGGT^ 

3 5 AGAAATTCACAGGCTTGTCTTCTG ACTTCAGTG AAAATCAAGTCTTTACAGACTATG AAAGTAGCCAAACCCTTTT 

GGGCAATAGTGAAGCTCAAGTCAGTGCAGCACGCnTCrATGTAGAAAATCCTAAGGAAATGGACGGACTCATGAA 
GCAGGTAGAAAACTTGGCCnTGGAAAATCAAGGCTACCAAGTCGAAAAGGAAAACAAGGCTTTT 
AGACTCAGTTGCAACITTCCAAACCTTCCTGACCATCTTCCnTrATGGGATGTTGATAGCAGGAGCT 
ATTCTGGTTTTGTCTCTCTGGTTGAGAGAACGGGTCTATGAAGTGGGGATTTTACT^ 

40 GCTCGATCTTCCTACAATTCTGTTTAGAGGTAGTTTTGGTATCTCT 

AACG C AATC ACAACTT ACCT ACTCC AAACTCT A CT AGC A AGTGG AG ATCAGGC AAG CTTAC AAG AT A C ACT AG CC 
AAAGC A AG C AGTTTATCAACT AGC ATCTT ATCTTTTG CAG A ATCCT ATGTTTTTCT AGTTCTG CTT AGTTG CTT ATC 
TGTAGCCCTTTGTTTCCTATTCTTATTTAGAAAATCACCGAAAGAAATTTT 

45 4142.5 

ATGTTACACAACGCATTTGCCTATGTTACAAGGAAGTTTTTCAAATCGATTGTCATCITCCTG 
GGCGAGCTTGAGTTTGGTCGGCTTGTCAATCAAGGGAGCTACTGCCAAGG 

ACCAATAGCTTCTCCATGCAAATCAATCGTCGCGTCAACCAAGGAACGCCTCGTGGTGCTGGGAATATCAAGGGT 
GAAGACATCAAAAAAATCACCGAAAACAAGGCCATTGAGTCTTATGTCAAACGTATCAACGCTAT CGGA GATTTG 

50 ACTGGATATGACCTGATTGAAACGCCAGAAACCAAGAAGAATCTCACTGCTGATCGTGCCAAGCGTTTTGGAAGT 
AGCTTGATGATTACAGGTGTCAATGACrCCTCTAAAGAAGACAAGTTTGTCTCTGGTTCTTATAAACTAGTCGAAG 
GAGAGCACTTAACCAACGACGACAAGGATAAAATCCTCITGCACAAGGACTTGGCAGCCAAACACGGCTG 
GTAGGGGACAAGGTTAAACTGGACTCTAATATCTACGATGCAGATAATGAAAAAGGAGCCAAGGAAACAGTTGA 
AGTGACAATCAAGGGACTCTTTGATGGTCATAATAAGTCAGCAGTAACCTACTCACAAGAACTTTACGAAAACAC 

55 AGCTATTACAGACATTC^CACTGCrGCAAAACrTTATGGATACACAGAAGACACAGCCATTTATGGGGACGCAAC 
C T T CT1 ' I GTAACAGCAG AC AAG AACTTGG ATG ATGTTATG AAAG AGTTG AATGGCATCAGTGGTATC AACTGG AA 
GAGCTACACACTCGTCAAGAGCTCCTCTAACTACCCAGCTCITGAGCAATCTATCTCTGGTATGT^ 
AACCTCCTCTTCTG GGGTAGCITG AGCTTCTC AGTTCTC CTCCTTG CCCTCTTG CTCAGCCTTTG G ATC AACG CCCG 
TCGCAAGGAAGTGGGAATTCTCCTCTCTATCGGCCTCAAGCAGGCAAGTATCTTGGGTCA 

60 ATTCATCACCGAATCrATCTTGATTGCTATCCCTGCTCTAGTTTCTGCTTACT^ 

CAATTGGAAACACTGTCCTTGCCAATGTGACTTCAGGTGrrGCCAAACAGGCrAGTAAGGCGGCTC 
ACCnTGGTGGTGGTGCAGAAGTAGATGGCTTTAGCAAGACCTTGTCGAGCCTAGACATTTCCATTCAGACATCAGA 
CTTTATCATCA'I'riTrGTCCTTGCCTTGGTTCTAGTGGTTCTCGTTATGGCGCITGCT 
AAC A ACC AAA AG AG CTCTTG CTGG ATGGTG AATAA 
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4144 1 

ATGTCACAGGATAAACAAATGAAAGCTGTTTCTCCCCTTCTGCAGCGAGTTATCAATATCT 

GGGTTGGGAGTTTGATTTTCTGTATTTGGGCTTATCAGGCTG 
ATCCAGCAGGCAGGCATCTGGGGTCCACCTCTCTTTATCTTTTTACAGATTTTAC 
5 AGGGGCCTTGACCTCXiGTGGCTGGGGTCTTTATCTACGGGCACATCATCGGGACrATCTACAACT 
GTGATTGGCTGTGCCATTATCTTTTATCTAGTGCGCCTATACGGAGCnXiCCriTGT CCAGTCT 
GCACCTACGACAAGTACATCG ACTGGCTAG ATAAGGGCAATCGTTTTGACCGCTTCTTTA inn ATG ATG ATTTG 
GCCCATTAGCCCAGCTGACTTTCTCTGTATGCTGGCTGCCCTGACCAAGATGAGCTTCAAG CGCTAC ATGACCATC 
ATCATTCTGACCAAACCCTTTACCCTCGTGGTTTATACCTACGGTCnXi ACCTATATTATTG AC" l"l " IT fCTGGC AAAT 
10 GCTTTGA 

4144.2 

ATG AG AAATATGTGGGTTGTAATCAAGG AAACCTATCTTCGACATGTCG AGTCATGG AG'l'ri'Cl" rCTI I ATGGTGA 

TTTCGCCGTTCCTCTTTTTAGGAATCTCTGTAGGAATTGGGCATCTCCAAGGTTCTTCT 
15 GTGGCAGTAGTGACAACAGTGCCATCTGTAGCAGAAGGACTGAAGAATGTAAATGGTGTTAACTTCGACTATAAA 
GACGAAGCAAGTGCCAAAGAAGCAATTAAAGAAGAAAAATTAAAAGGTTATTTGACCATTGATCAAGAAGATAGT 
GTTCTAAAGGCAGTTTATC ATGGCG AAACATCG CTTG AAAATG G AATTAAATTTG AGGTT ACAGGTACACTC AATG 
AACTGCAAAATCAGCTTAATCGTTCAACTGCTTCCTTGTCTCAAGAGCAGGAAAAACGCnTAGCGCAGACAA 
AATTCACAGAAAAGATTGATGAAGCCAAGGAAAATAAAAAGTTTATTCAAACAATTGCAGCAGGTGCCTTAGGAT 
20 TCTTTCTTTATATGATTCTGATTACCTATGCGGGTGTAACAGCTCAGGAAGTTGCCAGTGAAAAAGGCACCAAAAT 
TATGGAAGTCGTTTTTTCTAGCATAAGGGCAAGTCACTAT^ 

TAACGCATATTGGGATCTATGTTGTAGGTGGTCTGGCTGCCGTTTTGCTCTTTA AAGAT TTGCCA TTCTT 
TCTGGTATTTTGGATCACITCGGAGATCCTATCTC 

CGTAGTCTTGGCAGCCnTCCTAGGATCTATGGTTTCTCGTCCTGAGGACTCAGGGAAAGCCTTGTCGCCTTTGAT^ 
25 ATTTTGATTATGGGTGGTTITTTTGGAGTGACAGCTCrAGGTGCAGCTGGTGACAATCT 
TTATATTCCCTTTATTrCGACCTTCTTTATGCCGTTTCGAACGATTAA 

ATTTCACTTGCTATTACAGTGATTriTGCGGTGGTAGCAACAGGATTTATCGGACGCATGTATGCTAGTCT 
TCAAACGGATGATTTAGGGATTTGGAAAACCTTTAAACGTGCCTTATCTTATAAATAG 

30 4144.3 

ATGACAGAAACCATTAAATTGATGAAGGCTCATACTTCAGTGCGCAGGTTTAAAGAGCAAGAAATTCCCCAAGTA 

GACTTAAATGAGATTTTGACAGGAGCCGAGATGGCATCATXrTTGGAAG 

TACGAAGTCAAGAGAAGAAAGATGCCITGTATGAATTGGTAOT 

TCTCTTTGTCGG AG ATTTG AACCG AG C AG AAAAG G G AGCCCG ACTTC AT ACCG AC ACCTTCCAACCC CAAGGTGT 
35 GGAAGGTCTCTTGATTAGTTCGGTCGATGCAGCTCrTGCTGGACAAAACGCCTTGTTGGC 

TATGGTGGTGTGArrATCGGTTTGGTTCGATACAAGTCTGAAGAAGTGGCAGAGCTCTTTAACCTACCTG 
CCTATTCTGTCTTTGGGATGGCACTGGGTGTGCCAAATCAACATCATGATATGAAACCGAGACTGCCACTAGAGA 
ATGTTGTCnTTGAGGAAGAATACCAAGAACAGTCAACTGAGGCAATCCAAGCTTATGACCGTGTTCAGGCrGACT 
ATGCTGGGGCGCGTGCGACCACAAGCTGGAGTCAGCGCCTAGCAGAACAGTTTGGTCAAGCTGAACCAAGCTCAA 
40 CTAGAAAAAATCTTGAACAGAAGAAATTATTGTAG 

4146.1 

ATGTTAAAACrTATTGCTATTGTTGGAACAAATTCAAAACGTTCTACAAACCGTCAATTGCT^ 
AACACrTTACTGACAAAGCTGAAATrGAACITGTTGAAATCAAGGCCATTCCT 
45 GCAAGTACCTGCTGAAATATTGGAAATTGCrGCTAAAATCGAAGAGGCAGATG 
GTATGATCACrCTATTCCAGCTGTTTTGATGAGCGCTCTTGCTrGCnTGTCT^ 
AACCAATCATGATTACAGGTGCTTCTTACGGTACGCTTGGTC^ 

GAATGCTCCTGAAATCAAGGCAAATGTTCTTCCAGATGAATTCTTGCTCTCACACTCTCT^ 
AGTGGCGAC1TGGTTGACCTTGATGTTATCAAGAAATTGGATGCCATCTTTGATGACTT 
50 TCACAGAAAAATTACGTAATGCACAAGAATTACTTCGCAAAGATGCTGAA 

4146.2 

ATG AATACCTATCAATTAAATA ATGG AGTAG AAATTCCAGTATTGGG ATTTGGAAUl 1 I I AAGGCTAAGG ATGG A 
GAAGAAGCCTATCGTGCAGTGTTAGAAGCCTTGAAGGCTGGTTATC^ 

55 ATGAAGAAAGTGTTGGTCAAGCAATCAAAGATAGCGGAGTTCCACGTGAAGAAATGTTCGTAACTACCAAGCTTT 
GGAATAGTCAGCAAACCTATGAGCAAACTCGTCAAGCrTTGGAAAAATCTATAGAAAAACTGGGCITGG 
TGGATTTGTATITGATTCATTGGCCGAACCCAAAACCGCTCAGAGAAAATGACGCATGGAAAA CTCGCA ATGCGG 
AAGTTTGGAGAGCGATGGAAGACCTCTATCAAGAAGGGAAAATCCGTGCTATCGGCGTTAGCAA'll'lUCri'CCCC 
ATCATTTGGATGCCTTGCTTGAAACTGCAACTATCGTTCCnXjCGGTCAATCAAGTTCGCT TGGCG CCAGGTC 

60 TCAAGATCAAGTCGTAGCTTACTGTCGTGAAAAGGGAATTTTATrGGAAGCTTGGGGGCCTTrrGGACAAGGAGA 
ACTGTirGATAGCAAGCAAGTCCAAGAAATAGCAGCAAATCACGGAAAATCGGTTGCTCAGATAGCCTTGGCCTG 
GAGCTTGGCAGAAGGATTTTTACCACITCCAAAATCTGTCACAACCTCTCGTATTCAAGCT 

GGAATTGAACTGAGTCATGAGGAGAGAGAAACCTTAAAAACGATTGCTGTTCAATCGGGTGCTCCACGAGTTGAT 
GATGTGGATTTCTAG 
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4147.1 

ATGATCTGCAAAATGCTTGATCCAATTGCTATTCAACTAGGACCC 

TGACAGGCTTGATTCTTGCGGTTTATTTGACCATGAAAGAAGCACCTAGAAAGAAGA TCATA CCAGAC^ 
AGATTTTATCTTAGTAGCCTTTCCCTTC^ 

TAGTCAGAATTTAGGAGAGATTTTTGCCATTTGGAATGGTGGTTTGGCCATTTACGGTGGTTTG 

CTTGTGCTCTATATCTTTX5CTGACCGTAAACTCATCAATACTTGGGATTTTCT 

GATTGCTCAAAGTTTGGGGCGTTGGGGTAAT1TCT1TAACCAAGAAGCT 

TATCTACCTGGCTTTATCCGTGACCAGATGTATATTGAGGGGAGCrACCGTCAACCGACTTTCCTTTATGA 
TATGGAATCTGCTTGGCTTTGCCTTGATTCTGATTTTTAGACGGAA^ 

GGCCTTTTACTTGATTTGGTATGGTTrCGGTCGTATGGTTATCGAAGGTATGCGAACAGATAGTCT 

GCTTTCGAGTCn'CCCAATGGCTGTCAGTTGTCCTTATCGGTCTCGGTATAATGATCC^ 

GAAGGCCCCTTACTATATTACAGAGGAGGAAAACTAA 

4147.2 

ATGGGTAAATTATCCTCAATCCTTTTAGGAACCGTTTCAGGTGCAGCTCTTGCC^ 

GCAAACAAGTTTGCAGTCAGGCTCAAGATTTTCTAGATGATTTGAGAGAAGATCCGGAGTATGCCAAGGAGCAAG 

TCTGTGAAAAACTGACAGAAGTTAAGGAGCAGGCTACAGATTTTGTTCTGAAAACAAAAGAAC^ 

GTGAAATCACTGTGGACAGTATACTTGCTCAAACTAAATCCTATGCTTTTCAAGCGACAGAAGCATCAAAAAATC 

AATTAAATAATCTCAAGGAGCAATGGCAAGAAAAAGCCGAAGCTCTTCATGACTCAGAAGAGATTGTGATTGATA 

TAACAGAAGAATAA 

4147.3 

ATGAAAACTAAATTGATCTrTTGGGGCTCTATGCTCTTTCTCCTCrCCCT 

TTGGATTTTCTATCCTATGGAGATTCAGTGGCTAAACTTAACGAATCGAGTCTATCTAAAACCAGA 

TACAATTTTCATATCTTGATGAATTATCTGACCAATCCTTTTAGTCAGGTCTTACAGATGCCT 

AGCAGCTGGTCTGCACCATTTCGCAGTGGTCAAGAATCTCnTTCATTrGGTTCAGCTAGTA 

CCAAGTTTCTATGTCITTGTCAATAGGATTGTGAAAAAGGACTTrTTCT 

AGTAGTCTTACCTGTGATGATTGGACTTGGGGGAGTTTTGATTGGTTTTGACCAAT^ 

TTCTCTTTGTGGGAGATGATACCTGGCnTTTTGATCCAGCCAAGGAT 

TTTCTTCATGCCTTCCTCCTCTITTrTGCCCrCT 

GTGA 

4149.1 

atgacttatcattttactgaagaatacgatattattgtaattggtgcgggacacgctggggttgaggcttccttgg 

ccgctagccgtatgggctgtaaggtcctgcttgcgaccatcaatattgaaatgctggctttcatgccttgtaatcc 

ctctatcggtggttctgccaaggggattgtcgtgcgtgaagtcgatgccctcggtggcgagatggccaaaaccatt 

gacaagacttacatccagatgaagatgctaaacacagggaaggggcx:agctgtccgtgcccttcgtgcgcaggct 

gacaaggaactttactctaaggagatgcgcaagacggttgaaaaccaagaaaatctgacccitcgtcaaaccatg 

attgatgagattttggtggaagatggcaaggttgtcggtgtgcgtac^gccacccatcaagaatatgct 

gctgttattgtgacgacagggactgctctccgtggggaaattatcatcggagacctcaagtactcatcaggtccta 

accacagcttcgcttctattaacctagctgacaat^ 

aacccctccacgtgtcaaggcttcttctatcaattacgatgtgacagaaattcagccaggagacgaagtgcctaat 
catttctcatacacttcacgtgatgaggattatgtcaaggaccaagtaccatgctggttgacctataccaatggta 
cc agtc atg ag att atcc aaa ac aacctcc accgtgcgcct atgtttac aggtgtg gtc aa g g g agtggg g c ctc 
gttactgtccgtcgattgaagacaagattgtgcgctttc 
agggcgcaatactgaggaagtctatgtgcaaggactttcaaccagtctgcc 

GCATTCCATCAAAGGTTTGGAAAATGCAGAGATGATGCGGACAGGTTATGCTATTGA 

GTATGATATGGTCTTGCCTCATCAGTTGCGTGCGACTTTGGAAACCAAGAAAATCTCAGGTCTCTTCA 

CAGACAAATGGAACATCAGGTTACGAAGAGGCAGCAGGCCAAGGGATTATCGCGGGTATCAATGCGGCTCTGAA 

AATCCAAGGCAAGCCTGAATTGATTTTGAAGCGCAGTGATGGTTATATCGGGGTGATGATCGACGACTTGGTGAC 

CAAGGGAACCATTGAACCCTACCGTCTCTTGACCAGTCGTGCTGAATACCGTCTCATTCTTC GTCAT GACAATGCT 

GATATGCGCTTGACTGAGATGGGACGCGAGATTGGCCTTGTGGACGATGAACGCTGGGCTCGTTTTGAAATCAAG 

AAAAATC^TTTGATAATCAGATGAAGCGCCTAGACAGTATCAAACTCAAGCCAGTCAAGGAAACCAATG 

GTTGAGGAGATGGGCITCAAACCCTTGACCGATGCAGTGACAGCCAAGGAATrCCnTCGCCGTCCAGAAGTTTC^ 

ACCAAGATGTGGTGGCCTTCATCGGACCAGCTGCAGAAGACTTGGATGACAAGATTATCGAATTGATTGAAACAG 

AAATCAAGTATGAAGGCTATATTTCCAAAGCCATGGACCAGGTTGCCAAGATGAAACGCATGGAAGAAAAACGCA 

TTCCGGCCAATATCGACTGGGATGACATTGATTCTATCGCAACCGAAGCCCGTCAGAAGTTCAAACTCATCAATCC 

AGAAACCATCGGCCAAGCCAGCCGTATTTCGGGAGTAAACCCAGCAGATATTTCTATTTTGATGGTGTATCTGGAA 

GGTAAAAATCGTAGTATTTCTAAAACTCTTCAAAAATCAAAATGA 



4149.2 

ATGAAAGTATTAGCTTTTGATACGTCCAGCAAGGCTCTTTCTCTGGCT 
AGACGACGATTAATATTAAGAAAAATCACAGTATTACTCTTATGCCTGCCATCGATTTTT^ 
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TTGGACACCCAAGGATTTGGACCGAATCGTGGTAGCTGAAG 

AGCAACTGCTAAGACCTTAGCTCACACCCTGAACATCGAGTTGGTTGGTATGTCGAGTCrCTT GGCT 
CATCAACAAGAAGGTTTGTTTGTCCCCTTGATGGATGCGCGTCGCA^ 

CCAAACCTGTCATGGCAGAAGCGCACCTATCTTTTGAAGAGGTGCTAGAAAAAGTCAAGGGTACTAGT 

CCTTTGTCGGAGAAGTTGGCCCCTTTGTTGAGCAGATTCAAAAACACTTGCCAAGGACTGA 

GCCCAATGCAGCTAATCTAGCTCnTITGGCCTGGGACAAGGAAGCAGACTCCTrGCATGATTTT^ 

CTCAAACGAGTCGAGGCTGAGGAAAACTGGCTCAAGAACCATACCGAGTCTGGCGAGTCTTACATTAAACGCCTA 

TGA 

4149.3 

ATGATAGAAATCAAGCGAATTCAACAACAGCCTGACCTAGCTCAAGCCATCTACGCTGTTATGGCAGCTGTTTACC 

TAGTCAGTCCTTGGACTCTGGAGCAAATCCAAGCAGATCTGTCCCAAGACCAGACTTGGTATGCATTGGCTTA 

TGGGGCAGAAGTGATTGGATTTCTAGCTGTGCAGGAGAATCTTTTTGAAGCAGAAGTCCTGCAAATC 

AGGAGCTTATCAGGGTCAGGGGATTGCGTCagCCTTGTTTGCTCAATTGCCGACAGACAAGGAAATTTTCCT 

GTCAGACAGTCAAATCAACGAGCGCAAGCATTTTACAAGAAAGAAAAGATGACAGTTATCGCTGAGCGAAAGGC 

CTACTACCATGACCCAGTCGAGGACGCCATTATCATGAAGAGAGAAATAGATGAAGGATAG 

4152.2 

ATG ACAAAAC AAGTCTT ATTAGTG G ATG ATG AAG AAC AC ATTCTG AAATTG CTTG ACT ACC ATTT AAGTAAGGAA 

GGCITTTCTACTCAATTGGTGACAAATGGACGGAAGGCCTTAGCTTTGGCAGAAACAGAACCC^ 

TGCTTGATATCATGTTACCACAATTAGATGGCATGGAAGTTTGTAAGCGGCTGAGAGCCAAAGGCGTCAAAACTC 

CAATTATG ATGGTTTCTG CG AAAAGTG ATG AATTTG AT AAGGTTTTGG CCTTGG AATTAGG GGCTG ATGACTACCT 

GACCAAGC Cri ' ll AGCCCTAGAGAATTGCTGGCGCGTGTCAAGGCTGTCCTCAGGCGAACTAAAGGAGAACAAGA 

AGGAGATGATTCAGATAATATCGCTGACGATTCTrGGCTATTTGGGACCTTGAAAGTATACCCTGAGCGTCATGAA 

GTCTACAAGGCGAATAAGTTACTGAGTTTGACCCCAAAAGAATTTGAAAGCGATAAAAATCCGTTI^ 

TCAAAGTTTCGAAAGTAACCGCCCAATAA 

4154.1 

ATGACTACriUM'AAAGATGGATITTTATGGGGTGGTGCTGTTGCTGCTCATCAACnTGAAGGTGGATGGCAAGAAG 

GTGGCAAGGGAATTAGTGTTGCTGATGTTATGACTGCTGGTCGTCATGGAGTAGCTCGTGAAATTACTTTGGGA 

TTTAGAGGGTAAATATTATCCAAATCATGAGGCGATAGATTITTATCACCGTTATAAAGAAGATATAGCACTrr^ 

GCTGAAATGGGATTCAAGTGCTrCCGTACCTCTATTGCATGGACACGTATCTITCCAAAAGGTGATGAGTTAGAGC 

CGAATGAAGAAGGATTACAGTTTrATGATAATCTTTTTGATGAATGCTTAAAGAATGGTA 

TCTATCTCATTTTGAAATGCCTTATCACTTAGTGACCGAATATGGTGGTTGGAAAAATAGG AAAT TGATTGATTTC 

TTTGCTCGTTTTGCAGAAGTCGTATTTAAACGTTACAAAGATAAGGTTAAATATTGGATGACTTTCAATGAAATCA 

ATAATCAAGCGAATTATCAGGAAGATTTTGCACCATTTACTAACTCAGGTATTGTATATGAGGAAGGTGATAATAG 

AGAAGCAATTATGTATCAAGCAGCACATTACGAATTAGTTGCTTCTGCACGAGCTGTAAAAATTGGTCATGAGATT 

AATCCAGATTTTCAAATAGGTTGTATGATTGCGATGTGTCCAATTTATCCAGTTACTTGCAATCCTAAGGATATCTT 

AATGGCAATGAAAGCTATGCAGAAGCGTTATTATTTTGCTGATGTGCATGTTTTAGGTAAATATC 

TrCAAGTATTGGGAACGAAAAGGTATTTCAGTTGATTTrACTGCCCAGGATAAAGAAGATTTACTT GGTG 

TAGATTACATTGGTTTO\GTTACTATATGTCCTTTGCTATCGACTCrCATCGTGAAAATAATC 

CTTGAAACAGAAGATTTAGTGAAAAATAATTATGTTAAGGCTrCTGAATGGGAGTGGCAAATTGATCCAGAAGGT 

TTGCGTTATGCGTTAAATTGGTrrACAGACCACTATCACTTACCACTCTTTATTGTTGAAA^ 

AGATCAAGTTGCAGCAGATGGTATGGTACATGATGATTATAGAATTGAATATCTAGGTGCCCATATTCGTGAAATG 
AAAAAGGCTGTAGTTGAAGATGGTGTTGATTTAATGGGTTATACTCCATGGGGATGTATTGATTTGGTTTCAGCTG 
GTACCGGTGAAATGCGGAAACGTTATGGCTTTATTTATGTAGATAAAGATGATAATGGGAAGGGAAGTTATAATC 
GTTCCCCGAAAAAATCTTITGGCTGGTATAAGGAAGTTATTTCATCTAACGGTGAATCAGTAGAATAG 

4154.2 

ATGGATCAACAAAACGGGTTGTTTGGTTTTCITGAAAA 

AAGTAGTACGTGCTATCACGGCTGCAGGTATGGCTGCTGTACCATTTACTATTGTAGGATCAATGTTITTGGTATT 
CAGTATTTTGCCACAAGCTTTCTCATTTTGGCCAATTGT 

CACTTTACATGGTTGCAAACTATGCGACTATGGGTTCTCTATCTCTTTATTTCGTTCTATCACT^ 

ACAAAAATTTATGCAGAGGAAGAAGAACTCAATATGAATCCTCTTAATGGTGCCTTGCTTGCC^ 

TCATGACAGTACCGCAAATCATTTTTGATGGTGGAATGATGAAGACTGTGACAAGTCTAAAAGAAGGTGCAGTAA 

TTGCAGATGGATGGGCAATGGGAAATGTAGTCGCACGTTTTGGGACAACAGGGATTTTTACCGCAATCATTATGG 

C AATTGTG ACTGTTCTTATTT ATCGT ATGTGTGTT AAAC AT AATTG G GTTATT AAAATG CCTG AA G CTGTTCC AG AA 

GGAGTTTCTCXjTGGATTrACCGCTTTGGTTCCGGGATTTGTrGTTGCATTTGTTG 

GTAGCAATGGGAACAGATATTTTTAAAGTCATTGCAATTCCATTTGGTTTrGTATCCAATCTGACT 

TTGGTTTAATGATTATTrATCTATTGACrCAACTAC^^ 

TrrGTTAGTCCAATTGCTCTTGCTAACATGGCTGAAAATGCTGCTGGCGGGCACTTCGCTGTTGCA 
CTAATATGTTTGT AATTG C AGGTGGTTCTGGTG C AACTTT AGG ACT ATGTTT AT AT ATTG C 1 "IT 1GCCTCTAAATCT 
GAACAGCTTAAAGCAATAGGACG AGCATCTGTAGTTCCAGCCTT ATTT AAT ATT AATGAACGATTAATTTTTGGAT 
TACCrATTATCTATAATCCAGCCTTGGCTATACCATTTATTTTAGCACCAATGGTTACTGCT 

G CG AATTCTCT AAACTTTATTAAGCCAATTATCG C AC AG GTTC C ATGGCC AA CTCC AGT AGG G ATTGG AG CTTTCT 
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TAGGGACAGCAGATCTrCGAGCTGTATTAGTTGCTCTAGTATGTGCATTTGCACCATTCCTAGTCTATCTTC 
ATCCGTGTATATGATCAAAAATTGGTGAAAGAAGAGCAAGGTATCTAA 



5 ATGAAAAAATTTTATGTAAGTCCAATTTTTCCTATTCrAGTAGGATTG 

' TATITTTGTT AATAAT AATCTX5TTG ACGGTTTTAATTTTGT ITL" 1"1 ' I'l ' 1'GTAGG AGG CTATG r l"i r 1 1 1 ATTTAAG AA 
ACTGAGAGTGCATTATACAAGGAGTGATGTAGAACAGATACAGTATGTAAACCACCAAGCGGAAGAAAGTTTGAC 
AGCTCTATTGGAACAGATGCCTGTAGGTGTTATGAAATTGAATTTATCTTCTGGAGAGGTTGAG 
TATGCTGAATTGATTTTGACCAAGGAAGATGGTGATTTTGATTTAGAAGCTGTTCAA^ 

10 TAGGAAATCCGTCTACTTATGCCAAGCITGGTGAGAAGCGTTATGCrGTTCATATGGATGClUC 

GTATTTTGTAGATGTATCCAGGGAACAAGCCATAACAGATGAATTGGTAACAAGTAGACCAGTG ATTGGG ATTGT 
CTCTXjTGGATAATTATGATGATTTGGAGGATGAAACTTCTGAGTCAGATATTAGTCAAATCAAT^ 
AATTTTATATCAGAGTTTTCAGAAAAACACATGATGTTTTCrCGTCGGGTAAG TATG 
TGACTACACGGTGCTTGAGGGCTTGATGAATGATAAATTTTCrGTTATTGATGCTTTCAGAGAAG 

15 AG ACAGTTGCCCTTG ACCTT AAGT ATGGG ATTTTCTTATGGCGATGG AAATC ATG ATG AG ATAGGG AAAGTTG CTT 

TGCTCAATTTGAACrTGGCTGAAGTACGTGGTGGCGACCAGGTGGTTGTTAAGGAAAACGACGAAACGAAAAATC 
CAGTTTATTTTCGTGGTGGGTCTGCTGCTTCAATCAAGCGTACACGGACTCGTACGCGCGCT 

TTCAGATAAGATTCGGAGTGTAGATCAGGTTTTTGTAGTCGGTCACAAAAA TTTA GACATGGATGCTTTGGGCT^ 
GCTGTAGGTATGCAGTTGTTCGCCAGCAATGTGATTGAAAATAGCTATGCTCTTTATGATGAAGAACAAATGTCT 
20 CAGATATTGAACGAGCTGTTTCATTCATAGAAAAAGAAGGAGTTACGAAGTTGTTGTCTGTT^ 
GGATGGTGACCAATCGTTCriTGTTGATTCrrGTAGACCATTCAAAGACAGCCTTAACATTA 
TGATTTATTTACCCAAACCATTGTTATTGACCACCATAGAAGGGAT 

TATATCGAAAGTGGTGCAAGTAGTGCCAGTGAGTTGGTAACGGAATTGATTCAGTTCCAGAATTCTAAGAAAAAT 
CGTTTGAGTCGTATGCAAGCAAGTGTCTTGATGGCTGGTATGATGTTGGATACTAAAAATTTCACCTCGCGAGTAA 

25 CTAGTCGGACATTTGATGTTGCTAGCTATCTCAGAACGCGCGGAAGTGATAGTATTGCTATCCAGGAAATCGCTGC 
GACAGATTTTGAAGAATATCGTGAGGTCAATGAACTTATTTTACAGGGGCGTAAATTAGGTTCAGATC^ 
GCAGAGGCTAAGGACATGAAATGCTATGATACAGTTGTTATTAGTAAGGCAGCAGATGCCATGTTAGCCATGTCA 
GGTATTGAAGCGAGTTTTGTTCTTGCGAAGAATACACAAGGATTTATCTCTATCTCAGCTCGAAGTCG 
TGAATGTACAACGGATTATGGAAGAGTTAGGCGGTGGAGGCCACnTTAATTTGGCAGCAGCTCAAATTAAAGATG 

30 TAACCTTGTCAGAAGCAGGTGAAAAACTGACAGAAATTGTATTAAATGAAATGAAGGAAAAGGAGAAAGAAGAA 

TGA 

4156.1 

ATGAAAGAGAAAAATATGTGGAAAGAATTGTTGAATCGTGCAGGCTGGAT^ 
35 TATATCAGGTTCCCCTAGTGGTTACCTCTATTTTGACTTTAAAAGAAGTAGCCCTGCTACAGTCAG GGCT G 
TGCTGGCCTTTCAATTGTGGTTCTGGCTCTATTTATTATGGGAGCTCGTAAAACCAAGTTAGCT 
CTTTTTTTAGAG(nvU\AGATTTGGCACGTTTGGGCTTGAGTTATCT 

ATTTTATTGCAACTGTCAAATGAGACGACAACAGCTAACCAGTCTCAGATTAATGATATGGTTCAAAATAGTTCGT 
TGATTTCCAGTTTCrTCTTGCTAGCCTTGCTTGCTCCGATT^ 
40 AAGATTTTCCGAGGCAAGGAGAACTTGGGATTTGTAGTCGGTACGATTGTGTTTGCT^ 
ATTTACCTTCTTTATTGATTTATGGAGGTATGTCGACAGTTCTATCTTGGACAGCCTA 
AATGTCGATCTTGCTTCACATGATTGTTAAfGGGAT^^ 

CATTAGGAATTTCTGTTTAAATGAAAGAGAAAAATATGTGGAAAGAATTGTTGA ATCG TGCAGGCT 
CTTTTTACnTGCCXn'CCTTTTATATCAGGTTCCCCrAGTGGTTACCTCT 
45 TACAGTCAGGGCTGATAGTTGCTGGCCTTTCAATTGTGGTTCTGGCTCTATTTATTATGGGAGCT 

GTTAGCTAGTTTTAATTTTTCTTTTTTTAGAGCTAAAG^ 
GGTCAAATATACrTGGTTCCATTTTATTGCAACTGTCAAATGAGACGACAA 

GTCGTGGGATTGTTCCTAAAAAGATTTTCCGAGGCAAGGAGAACTTGGGATTTGTAGTCGGTAOT^ 
50 TTTATTGCATC AACCAAGTAATTT ACCTTCTTTATTG ATTTATGGAG GTATGTCG AC AGTTCT ATCTTGG ACAG CCT 

ACAAGACCCAACGTTTGGAAATGTCGATCTTGCTTCACATGATTGTTAATGGGATTGCm 
GTGGTG ATT ATG AGTCGG AC ATT AGG A ATTTCTGTTT AA 



55 ATGGATACACAAAAGATTGAAGCGGCTGTAAAAATGATTATCGAGGCTGTAGGAGAGGACGCTAATCGCGAGGGC 
TTGCAGGAAACACCTGCTCGTGTAGCCCGTATGTATCAAGAGATTTTTTCAGGTCTTGGTCAAACAGCAGAGGAAC 
ATTTGTCAAAATCCTTTG AAATTATTG ACG ATAATATGGTGGT AG AAAAGG AT ATC^^ A 
ACACCACnTCTTGCCATTTTATGGTAGAGCGCACATTGCCTACAT^ 

CTAGCCCGTACGGTTGAAGTTTATTCGAAAAAACCACAAATTCAAGAACGTTTGAATATCGAAGTGGCCGATGCC 
60 TTGATGGACTATCTAGGTGCTAAAGGAGCCTTTGTTGTCATTGAGGCGGAACATATGTGTATGAGTATGCGTGGTG 
TTAGAAAACCAGGCACTGCAACCTTGACGACAGTAGCTCGTGGTCTATTTGAAACAGATAAGGATCTCCGTGACC 
AAG CTT ATCGTTT AATGGGG CT AT AA 

4157.2 
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ATGAAAGACTTGTTTTTAAAGAGAAAGCAGGCCrTT 
ACCACTTTGTCTTGTTCCnXjCTTGTCCTGTTGGGCTTTCTAGCCT 
GAAAATCATTGGCCTATCCTTTTGTTTGTAGGAATTACGTCTGTTTTACTTTT 
TATGGAGGCTCCAGACAAGCTCTTTCTCITAGTTGGAGAAGA 
5 CATTTCCCTAGTCTTTTGGCTCTTTGTACAG ACCC1' 1TTCTTGCTGTTATTTGCGCC TTTA l r 1 1 i AGCAATGGGTTA 

TGGCTTGCCAGTTTTTCTGCTCTATGTGCTTTTATTGGGGGTAGGAAAATAT^ 
AATTTTTCACTGAAACTGGACTGGACTGGGACTATGTTATTTCTCAAGAAAGCAAG 
TTTCTITGCCCTCTTTACGCAGGTCAAGGGAATTTCAAACAGCGTTAAGCGTCGTGCCT 
AGGCTGTTCAGAAGGTGCCTGGGAAGATrrGGCAAAATCTCTATCTGCGTTCTTATCTGCGAAA 
10 TGCTCTCAGTCTTCGTCTTCTCTTGCTTTCCTTQCTGGCGCAGGTTTTTATTO 

TGGTAGTTCTCTTTAACTACCTCTTGCTCTTCCAGTTGCTGGCCCTCTATCATGC 
CAACTCTTTCCGCTGGACAAGGGGCAAAAGGAAAAAGGCTTACAGGAGGTAGTTCGAGGATTGA 
TTACTTGTGGAATTAGTTGTTGGGTTGATTACCTTCCAAGAAAAACTAGCCCTTCTAGCCTTACTA 
TGGTTTTACTAGTCTTGTATTTGCCTTATCAGGTAAAACGTCAGATGCAGGACTAA 

15 

4158.1 

ATGAGAAAATCAATAGTATTAGCGGCAGATAATGCCrATCTTATTCCTTTAGAGACGACTATAAAGTCTGTATTGT 

ATCACAATAGAGATGTTGATrTTTATATTCTCAACAGTGATATAGCTCCTGAATGGTTTAAATTATTGG 

AATGGAAGTTGTGAATTCTACAATTCGCAGTGTACACATTGATAAAGAACTTTTTGAAA 

20 CATATAAATTATGCTTCTTACTTTAGATTTTTTGCGACAGAAGTGi3TTGAATCT 

CGATATCATTGTAACTGGGGAACTAGCTACTTTGTTTGAGATAGATCTCAAAGGATATTCAAT^ 
GATGTCTATGCCTATGAAGGACGAAAATCTGGATTTAATACTGGTATGTTACTAATGGATGTTGCAAAGTGGA^ 
AACATTCTATTGTCAATAGTTTATTGGAATTAGCGGCCGAGCAGAATCAAGTTGTTCATCTTGGGGATCAGAGTAT 
TTTAAATATTTATTTTGAGGATAATTGGCTAGCCTTAGATAAAACATATAATTATATGCT 

25 ACCTTGCTCAAGAATGTGAACGTCTAGATGACAATCCACCTACAATTGTTCACTATGCTAGTCATGATAAA CCTTG 
G AATACATATAGTATATCTAG ACTACGTG AATTATGGTGGGTTTATAG AG ATITGG ATTGGTCAG AG ATTGC n 
CAACGTTCCGATTTAAATTATTTTGAAAGAAGCAATCAGTCTAAAAAACAAGTGATGCrrGT 
GATATAAAACATTTAGAGTATTTAGTACAACGGTTACCTGATTGGCATTTTCATTTGGCTGCACCGTC 
TGAGGAGCTGACCTCTCTATCACAGTATACGAATGTAACAGTATATCAAAATGTATTACATAGTAGAATTGATTGG 

30 CTATTGGACGATTCTATAGTTTATTTAGATATTAATACAGGTGGAGAGGTTTTTAATGTAGTTACAAGG GC 

AAAGTGGCAAG AAAATCHTCGCTTTTG ATATCACACGTA AAAGTATGG ATG ATGG ACTCTATG ACGGTA 1 1 I i i 1C 
TGTGGAGAGACCAGATGATTTAGTGGATAGAATGAAGAATATAGAGATAGAGTAA 



35 ATG ACTAAG ATTT ATTCGTC AAT AG C AGT AAAAAAAG G ACT ATTTACCTC ATTTCT ACTGTTT ATCTATGT ATTGG 

GAAGTCGTATTATTCTCCCTTITGTTGACCTAAATACTAAAGATrTTrTA 

TCAGCCGCCCTAACAGGTGGGAATCTAAGAAGTTTATCAATTTTITCTGTTGGATTATCCCCr^ 
TG ATTTTATGGC AG AT UrrriC CT AAACGGTTGGGTTTAAC AT CTACGTCTATAGAAATACAAGATCGCCGT 
AAAATGTACCTGACCTTGCTAATTGCTGTGATTCAATCCTTGGCAGTTAGCTrGAGA 
40 ATTCTGCAATATTGGTTGTTCTAATGAATACAATATTGCTGATAGCAGGAACA'IM"IU , ri CI 1 GTTTGGTTGTCAGAT 

TTAAATGCGAGTATGGGGATTGGAGGTTCrATTGTAATCCTCCTATCCAGTATGGTTTTAAATATTCCTC AGGAT G 
TTTTGG AAACATTTCAG ACAGTACACATTCCAACAGGG ATTATTGTGTTACTTG CTTTATTAACCCTTGTl; i i I i CT 
TATTTACTTGCCCTTATGTATCGAGCTCGCTATTTGGTTCCrGTTAATAAAATTGGCTTACA 

CTATTCTTATCTCG AAATCATGTTG AATCCTG C AGGTGGG ATGCCTT ATATGTATGTG ATGAG'IT 1 TCTI AGTGTAC 

45 CAGCHTATITGTTCATCITGTTGGGATTTATTTTCCCrAATCATTCAG 

GTTGG AAAGCCTTrGTGGGTCTATGTTTATATTTCGGTCnTATTITTATTTAG ri " i GTT ACG ATG 

AATGGAGAAGAGATTGCAGACCGTATGAAAAAATCTGGAGAATACATTTATGGT ATTT ATCCAGGTGCGGAT ACT 
AGTCGATTTATTAATCGATTGGTCCTTCGTTTCTCAGTCATAGGTGGTCTCTTTAATGTGATTATGGCAGGTGGTCC 
CATGCTTTTTGTTTTGTTTGATGAAAAGTTATTACGATTGGCAATG 

50 TGATTTTTACGATTAGAGACGAGGTCAAGGCTTTAAGGCTAAATGAGACCTATAGACCnTTGATTT 

4158.3 

ATGTCCTCTCTTTCGGATCAAGAATTAGTAGCTAAAACAGTAGAGT^ 
TAGACGATATTrrGGTTGAAGCTTTTGCTGTGGTGCGTGAAGCAGATAAGCGGATTTTAGG 

55 TGTTCAAGTCATGGGAGCTATTGTCATGCACTATGGAAATGTTGCTGAGATGAATACGGGGGAAGGTAAGACCTT 
GACAGCTACCATGCCTGTCTATTTGAACGCITTTTCAGGAGAAGGAGTGATGGTTC 
TCAAAGCGTGATGCCGAGGAAATGGGTCAAGTTTATCXjTTTTCTAGGATTGACCATTGGTGTACCATT^ 
ATCCAAAGAAGGAGATGAAAGCTGAAGAAAAGAAGCTTATCTATGCTTCGGAT ATCA TCTACACAACCAATAGTA 
ATTTAGGTTTTGATTATCTAAATGATAACCTAGCCTCGAATGAAGAAGGTAAGTTITrACGACCGTTT 

60 GATTATTGATGAAATTGATGATATCnTGCITGATAGTGCACAAACTCCTCTGATTATTGCXj 

AGTCTAATTACTATGCGATCATTGATACACTTGTAACAACCTTCKjTCGAAGGAGAGGATTAT^ 

GAAAGAGGAGGTTTGGCTCACTACTAAGGGGGCCAAGTCTGCTGAGAATTTCCTAGGGATTGATAATT^ 

GGAAGAGCATCCGTCTTTTGCTCGTCATTTGGTTTATGCGATTCGAGCT^ 

TATATCATTCGTGGAAATGAGATGGTACTGGTTGATAAGGGAACAGGGCGTCTAATGGAAATGACTAAACTTCAA 
65 GGAGGTCTCCATCAGGCTATTGAAGCCAAGGAACATGTCAAATTATCTCCTGAGACGCGGGCTATGGCCTCGATC 
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ACCTATCAGAGTCTTTTTAAGATGTTTAATAAGATATCTC 

TTTATTGAAACTTACAATATGTCTGTAGTACGCATTCCAACCAATCGTCCGAGACAACGGA 
TTGACTATCCAGATAATCTATATATCACTTTACCTGAAAAAGTGTATGCATCCTTGGAGTACAT^ 
TGCTAAGGGAAATCCTTTACTCGTTTTTGTAGGCTCAGTTGAAATGTCTCAACTCT 
5 AAGGGATTGCCCATAATGTCCTAAATGCTAATAATGCGGCGCGTGAGGCTCAGATTATCTCCGAGTCAGGTCAGA 
TGGGGGCTGTGACAGTGGCTACCTCTATGGCAGGACGTGGTACGGATATCAAGCTTGGTAAAGGAGTCGCAGAGC 
TTGGGGGCTTGATTGTTATTGGGACTGAGCGGATGGAAAGTCAGCGGATCGACCTACAAATTCGTGGCCGTTCT 
TCGTCAGGGAGATCCTGGTATGAGTAAATTTTTTGTATCCTTAGAGGATGATGTTATCAAGAAATTTGGTCCATCT 
TGGGTGCATAAAAAGTACAAAGACTATCAGGTTCAAGATATGACTCAACCGGAAGTATTGAAAGGTCGTAAATAC 

10 CGGAAACTAGTCGAAAAGGCTCAGCATGCCAGTGATAGTGCTGGACGTTCAGCACGTCGTCAGACTCTGGAGTAT 
GCTGAAAGTATGAATATACAACGGGATATAGTCTATAAAGAGAGAAATCGTCTAATAGATGGTTCTCGTGACTTA 
GAGGATGTTGTTGTGGATATCATTGAGAGATATACAGAAGAGGTAGCGGCTGATCACTATGCTAGTCGTGAATTAT 
TGTTTCACTTTATTGTGACCAATATTAGTTTTCATGTTAAAGAGGTTCCAGATTATATAGATGTAA 
GCAGTTCGTAGCTTTATGAAGCAGGTGATTGATAAAGAACTTTCTGAAAAGAAAGAATTACTTAA^ 

15 TATATGAACAGTTTTTACGACTTTCACTGCTTAAAGCCATTGATGACAACTGGGTAGAGCAGGTAGACTATCT 

ACAGCTATCCATGGCTATCGGTGGTCAATCTGCTAGTCAGAAAAATCCAATCGTAGAGTACTATCAAGAAGCCTA 

CGCGGGCTTTGAAGCTATGAAAGAACAGATTCATGCGGATATGGTGCGTAATCTCCTGATGGG 

CACTCCAAAAGGTGAAATCGTGACTCATTTTCCATAA 

20 4158.4 

ATGATAGGGACTTTCGCCGCTGCTCTTGTAGCTGTACTAGCAAATTTCATCGTCCCTATTGAAATTACCCCAAATA 
GTGCCAATACTGAAATTGCACCACCAGATGGGATTGGGCAGGTTCTCAGCAACCTC TTGCTC AAACTGGTTGACA 
ACCCAGTCAACGCCCTGCTTACTGCTAACTATATTAGAATCTTATCTTGGGCAGTCATTT^ 

AGAAGCCAGTAAAAATAGTCAAGAATTGCTAAAAACTATCGCTGACGTGACTTCTAAAATTGTCGAATGGATCAT 
25 CAATCTGGCTCCATTTGGAATCCTTGGTCTTGTTTITAAAACCATTTCT^ 

TACGGTATTTTATTGGTTCTATTAGTAACG ACTATGCTTTTTGTTG CCCCTGTGGTCAACCCT TTG ATT GCl'i i CI i C 

TTTATGAGACGCAATCCTTACCCTCTAGTTTGGAACTGCCTCCGTGTCAGCGGTGTGACAGC 

GTTCTGCGACTAACATTCCTGTCAACATGAAACTCTG 

TATCCCACTCGGTTCTACTATCAATATGGCTGGAGTAGCGATTACCATTAACCTTTTC 
30 ACTCTTGGAATTCCTGTTGACrTTGCCACAGCCTTTGTCCrCAGTGTGGTAGCAGCTATCTCATCCr 
AGGTATTGCCGGAGGTTCCCTCCTrCTTATCCCAGTTGCTTGTAGCCTTTTCGGT^ 
AAATTGTTGGGGTTGGTTTTGTGATTGGTGTCATCCAAGACTCATGTGAAACAGCCCT 
CCTCTTTACCGCCGTTGCCGAATACGCAGCAACCCGTAAAAAATAA 

35 4158.5 

ATGTCTATTAGCCAACGTACGACCAAGCTCATCTTAGCTACCTGTCTTGCCT 

TTCGTCAGCAGTTTCGGCTGGAATTATCGCTCTCTTGAGCCTATCTGATACGCGTAGAAGTACTTTAAAACT 
CGCAATCGTCITTTTTCTATGCTTCTAGCTCTGGCTATCGGTGTTCTAGCT^ 

GAGTCTCGGCCTCTATCTGGCCTTCTACGTTCCnTTAGCCTACAAGATGGGCTGGGAAATTGGCATCACA CCAA GC 
40 ACTGTTTTGGTTAGCCATCTCTTGGTTCAAGAGTCAACCTCTCCAGACCTTCTAGTCAATGAATTCCTTCT 

TATTGGTAC AGG ATTTGCCTTG CTTGTTAATCTCTATATGCCTTC ACG AG AAG AGG AAATCCAG C ACTACCAC ACG 
CTGGTGGAAGAAAAGTTAAAAGATATCCTCCAGCGCTTCAAATACTATTTATCCAGAGGAGACGGACGCAACCGA 
GCACAGCTGGTAGCAGAATTAGACACGCTTTTGAAAGAAGCCCTCAGACTGGTCTATTTGGATCACTCTGACCACC 
TCTTrCACCAGACAGACTACCATATCCACTACTTTGAGATGAGACAGCGACAAAGTCGTATCCTGAGAAACATGG 
45 CCCAACAGATTAACACTTGTCACCTTGCCGCCAGTGAAAGCCTGATCTTAGCGCAACTCTTTTGAAAAAT^ 
TCAACTGAGCCAGACCAATCCTGCTTCTGATTTGCTAGATGAAATTGAACGTTATCTGGAAGTCT^ 
AGTCTGCCCAAGACAAGAGAAGAATTTGAAACCCGCGCCACCCTTCTTCAACTCCTACGTGAAGCCAAAACCTTC 
ATCCAAGTAAAAGTTGATTTTTACCAAAAATATAGACAGTAA 

50 4158.6 

ATGGAAATCATGTCGCTTGCGATTGCTGTTTTTGCCGTCATCATTGGTTTAGTCATTGGATATGTCAGCATCTCAGC 
TAAG ATG AAATC ATCTC AGG AAG CTG C AG AGTTG ATG CTTTT AAATGCTG AAC AAG AAG C AACT AATTT ACGTG G 
ACAAGCTGAGCGTGAAGCGGATTTACTTGTTAATGAAGCCAAACGTGAAAGCAAGTCTCTTAAAAAAGAAGCACT 
ATTGGAGGCCAAAGAAGAAGCCAGAAAATACCGTGAAGAAGTGGACGCTGAATTCAAATCAGAACGTCAAGAAC 

55 TCAAACAAATCGAAAGTCGTTTGACAGAGAGAGCTACTAGCCTTGACCGTAAGGACGACAATTTGACGAGTAAAG 
AACAAACACTTGAACAAAAAGAACAAAGTATTTCTGATAGAGCGAAAAACCTTGATGCGCGTGAA 
AGGAAGTCGAAAGACAAAAAGAAGCAGAACTAGAGCGTATTGGTGCGCTGTCTCAGGCAGAAGCAGGAGATATT 
ATCTTGGCTCAGACAGAGGAAAACTTGACCAGGGAGATTGCCAGTCGCATTCGCGAAGCTGAGCAAGAGGTCAAG 
GAACGTTCTGACAAAATGGCCAAGGACATCTTGGTTCAAGCTATGCAACGTATCGCTGGTGAATATGTAGCGGAG 

60 TCAACAAACTCAACAGTTCATCTGCCAGACGATACTATGAAGGGACGCATTATTGGTCGTGAAGGTCGTAACATT 
CGTACCTTTGAAAGTTTGACAGGGGTCGATGTGATTATCGACGATACACCAGAAGTGGTGACCTTGTCAGGATTTG 
ATCCG ATTCGTCGTG AG ATTG CCCGT ATG ACT ATGG A AATGTTGCTC AA AG ATG GTCGT ATAC ATCCAG CTCGT AT 
CGAAGAGTTGGTTGAGAAAAACCXjTCAAGAGATTGACAATAAGATTCGTGAATACGGTGAGGCTGCTGCCTATGA 
AATTGGTGCGCCAAACCTTCATCCAGACnTGATGAAGATTATGGGACGTTTGCAGTTCCGTACTTCATATGGACAA 

65 AATGTTTTG CG CC ATTCG ATTG AGGTTG CT AAGTTGGCTGGT ATC ATG G CG AGCG AACTTGGTG AAAATGCG G CTC 
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TTGCCCGTCGTGCTGGATTCCTTCACGATATTOGGAAAGCCATTGACCATGAGGTTGAAGGTAGCCACGTTGAAAT 
CGGTATGGAATTGGCCCGTAAGTACAAGGAACCCCCAGTTGTGGTGAATACGATTGCTAGTCACCACGGAGATGT 
TGAAGCTGAGAGCGTGATAGCAGTTATCGTCGCTGCAGCAGATGCCTTGAGCGCAG CCCGT CCAGGTGCTCGTAG 
TGAGTCTCTTGAAAGCTACATCAAGCGTCTCCATGATTTGGAAGAAATTGCTAACGGCTTT 
5 AGCTTTGCCCTTCAAGCAGGACGTGAAATTCGTATCATGGTCAATCCAGGAAAAATCAAGGACGACAAAGTCACA 
ATCTTGGCTCACAAAGTTCGTAAGAAAATTGAAAACAATCTCGATTATCCAGGAAATATCAAGGTAACCGTGA^ 
CGCGAGCTTCGTGCAGTAGATTATGCTAAATAA 



10 ATGATGTTAAAACCCTCTATTGATACCTTGCTCGACAAGGTTCCTTCAAAATATTCACTCCT 

AACGTGCCCACGAATTGGAAGCAGGTGCCCCAGCAACTCAAGGTTTCAAGTCTGAAAAATCAACT 
TAGAAGAAATCGAATCAGGAAACGTTACAATTCACCCAGATCCAGAAGGAAAACGTGAAGCAGTGCGTCGCCGTA 
TCGAAGAAGAAAAACGCCGCAAAGAAGAAGAAGAAAAGAAAATCAAAGAGCAAATTGCTAAAGAAAAAGAAGA 
TGGTGAAAAAATTTAA 

15 

4161.1 . 

ATGTCAGCATATCAATTACCGACCGTATGGCAGGATGAAGCTAGTAATCAAGGAGCT TTTAC XjGGGCTAAACAGA 
CCAACAGCAGGTGCCCGTTTCGAACAAAACTTGCCAAAAGGAGAACAAGCITrrCA 
CCAAATGGTGTGAAGGTTACTATCTTATTGGAAGAATTA 
20 ATAAGATTGCTATCATGGATGGGGATCAATTCGGATCAGACTTTGTGAAGCTCAATC CAAAT TCCAAGATTCCAGC 
CTTATTGGACCAGTCAGGTACTGAAAACGTAAGAGTCTTTGAGTCTGCTCATATTC^ 
TTTGGAGCCTTTTTACCAAGTAATCCTGTGGAAAAGGTAGAAGTTTTGAATTGGCTATTCT 
CACCTTTTCTAGGTGGGGGATTTGGACATTTCTrCAATTATGCTCCTGAAAAATTC 

ACG ATGG AAGTG AAACG CCAGTTGG ATTTATTGG ATAAGG AATTG GCTCAG AAACC TT ATA TTGCAGGCAATG AC 
25 TATACGATTGCAGATATTGCTATCTGGTCTTGGTATGGACAGTTAGT^ 

AATTCTTGGATGCCTCAAGTTATCAAAATCTAGTAAAATGGGCAGAAAAAATTGCCAATCGTCCAGCTGTTAAGC 
GTGGCTTGGAAGTAACTTATACAGAAATTAAATAG 

4161.2 

30 TTGGCAAGCTTGATCACTTCTATCATCATGTTCTATGTCGGT 

CAGTCGGGAAGAAACGGTCATTGATCCTCTTGGTGCAACTCTAGGAATCATTTCTC 

TATCTCTACAATACTCGCCTCAGTAAGAAATCCAACTCCAATGCGCTGAAGGCAGCTGCTAAGGACAATCTTTCT 
ACGCTGTTACCTCACTTGGAACCGCCATTGCCATCCTAGCTAGTAGTTTCAATTATCCG ATTGT GG ATA^ CTGCT 
TGCTATCATCATCACTTTCnTTATCTTGAAGACTGCCTATGATATCrTCATCG 
35 GCTTTGACGACCGCCTGCTCGAGGACTACCAAAAGGCTATCATGGAAATTCCCAAAATCAGCAAGGT CAAATCG C 
AAAGAGGTCGCACCTACGGTAGCAACATCTACCTGGATATTACACTAGAGATGAATCCTC 

AAGCCATGAAATCGCGGATCAGGTCGAGTCTATGCTGGAGGAGCGTTTTGGCGTCTTTGATACCGATGTCCATATC 
GAACCAGCACCTATCCCTGAGGATGAAATTTTAGACAATGTCTATAAAAAATTGCTTATGCGTC 
ACCAAGGAAACCAACTAGAAGAACTCTTGACTGATGATTTTGTCTATATTCGCCAAGATGGAGAGCAGATGGATA 
40 AAGAGGCTTATAAGACCAAAAAAGAGTTAAATTCTGCTATCAAGGACATTCAAATrACTTCCATCA 

CCAAACTCATCTGCTATGAGTTAGATGGTATCATCGATACCAGTATCTGGCGTCGCCACGAAACCTGGCAAAATAT 
CTTTCATCAAGAAACCAAAAAAGAATAG 

4162.1 

45 ATGACAATTAAACTAGTAGCAACGGATATGGACGGAACCTTCCTAGATGGGAATGGACGCTTTGATATGGATCGT 

CTCAAGTCTCTCTTGGTTTCCTACAAGGAAAAAGGGATTTACTT^ 

TAGAAAAATrATTTGCTGGTGTTCGTGATGACATTATTTTCATCGCGGAAAATGGCAGT^ 
TCAGGACTTGTATGAAGCGACTATGTCTCGTGACTTTTATCTGGCAACTTTTGAAAAG 

GTAGATATCAATAAACTGCTCTTGACGGGTAAGAAGGGTTCATATGTTCTAGATACGGTTGATGAG ACCTA TTTGA 
50 AAGTG AGTCAGCACTATAATGAAAATATCCAAAAAGTAGCGAGTTTGGAAGATATCACAGATGACATTTTCAAAT 

TTACAACCAACTTCACAGAAGAAACGCTGGAAGATGGGGAGGCTTGGGT AAACG AAAACGTTCCTGGTGTTAAGG 
CCATGACAACTGGCTTTGAATCCATTGATATTC^ 

TAAAAAACTTGGTATCACAATGGATCAGGTCATGGCTTTTGGAGACAATCITAATGACTTACATATGATG 
GTGGGACATCCTGTAGCTCCTGAAAATGCACGACCTGAAATTTTAGAATTAGCAAAGACTGTGATTGGTCACCATA 
55 AGG AACGGTCGGTTATAG CTTATATGG AG GGCTTAT AA 

4162.2 

ATGGCAGATATAAAATTGATTGCATTGGACTTGGACGGGACCTTGCTGACTACTGATAAAAGGCTGACGGATCGT 
ACCAAGGAAACCITGCAAGCTGCGCGTGATCGTGGTATCAAGGTCGTATTC 

60 ATGGA lUUCiri CTCCATGAGTTAGGGACTGACGGTCAGGAAGATGAGTATACCATTACrTTTAATGG 

TTC^GAAAAATACAGGAGAAATCCirGATAAAACAGTCTTTTCATATGATGATGTGGCACGTTTGTATGAAGAAA 
AGAGAAATTATCACTGCCTCITGATGCCATCTCAGAAGGAACAGTTTATCAAATCCAATCGGACCAAGAAAGTCT 
TTATGCCAAATTCAATCCAGCITTGACCTTTGTTCC>GTGGACTTTGAAG 
AAATGCGTGACTGCCnTTGCTCAAGAACCCITGGATGCAGCCATTCAGAAGATTrCTCCAG^ 

65 ATGAAATCTTTAAATCACGTGAAATGTTGCTAGAATGGTCACCAAAGAATGTTCATAAAGCAACAGGTTTGGCAA 
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AACTAATCAGCCATCTTGGAATCGACCAAAGTCAAGTGATGGCTTGTGGTGACGAGGCCAATGACCTCTCTATGA 
TTGAATGGGCAGGTCTTGGTGTTGCTATGCAAAACGCTGTTCCTGAAGTAAAGGCAGCCGCAAATGTAGTGACGC 
CGATGACCAACGATGAGGAAGCTGTCGCCTGGGCTATCGAAGAATATGTGCTAAAGGAGAACTAA 



ATGGAAAGTTTACTTATTCTATTATTAATTGCCAj\TCTAGCTGGTCTCTTT 

AGGAGAAACACTTAAGTAAGAGCTTGGAGGATCAGGCAGATCATTTCTCAGACCAGTTGGATTACCGCTTTC 
AAGCCAGACAAGCCAGCCAGTTAGACCAAAAAGATTTGGAAGTGG^GTCAGCGACCGTTTGCAAGAAGTGCGGA 
TTGAATTGCACCAAGGTCTGACCCAAGTCCGTCAAGAAATGACAGATAATCTCCTCCAAACTAGAGACAAGACAG 
10 ACCAACXjTCTCCAAGCCTTGCAGGAATCAAATGAGCAACGTTTGGAACAAATGCGCCAGACGGTCGAGGAAAAAC 
TAGAAAAGACCTTGCAGACACGCTTACAGGCTTCCTTTGAGACAGTTTCTA^ 
CCTTGGAGAAATGCAGACAGTTGCCCGTGATGTCGGAGCTCTTAACAAGGTTCTCT 

GATTCTGGGAGAATTGCAACTGGGGCAAATTATTGAAGACATCATGACACCTGCCCAGTACGAACGAGAATACGC 
AACGGTTGAAAACTCTAGTGAACGAGTGGAGTATGCCATCAAGTTACCCGGACAAGGCGACCAAGAATACGTCTA 
15 TCTGCCAATTGACTCTAAGTTTCCACTGGCAGATTATTACCGCTTGGAAGAAGCCTATGAGACAGGTGACAAGGAT 
GAGATTGAACXjCTGTCGTAAGTCACTCCTAGCAAGCGTCAAGCGCTTTGCTAGGGATA 

GCACCACCTCGG ACG ACCAATTTTGG AGTTTTGTTTGTTCCG AC AG AAG GTCTCTACTC AG AAATCGTCCG C AATC 
CGGTCTTCTTTGATG ATTTG AG ACG G G AAGAACAG ATT ATTGTTG C AG G ACC AAGTACCCT ATC AGCCCTTCTTAA 
CTCCCTATCAGTTGGTTTCAAGACCCTTAATATCCAAAAGAGTGCCGACCATATCAGCAAGACTCTTGCCAGTGTC 
20 AAGACCGAGTTTGGCAAGTITGGTGGTATTCTGGTCAAGGCACAAAAACATCTCCAACATGCCTCTGGCAATATTG 
ATGAATTATTAAACCGTCGTACCATAGCTATCGAGCGGACGCTCCGTCACATTGAGTTGTCAGAAGGTGAGCCTGC 
GCTTGATCTACTCCATTTTCAAGAAAATGAGGAAGAATATGAAGATTAG 

4164.3 

25 ATGAAGATTAGTCACATGAAAAAAGATGAGTTATTTGAAGGCTTTTACCTAATCAAATCAGCTGACCTGAGGCAA 
ACTCGAGCTGGGAAAAACTACCTAGCCnTTACCTTCCAAGATGATAGTGGCGAGATTGATGGGAAG CT 
GCCCAACCTCATAACATTGAGGCCTTTACCGCAGGTAAGGTTGTCCACATGAAAGGACGCCGAGAAGTTTATAAC 
AATACCCCTCAAGTCAATCAAATTACTCTCCGCCTGCCTCAAGCTGGTGAACCCAATGACCCAGCTGATTTCAAGG 
TCAAGTCACCAGTTGATGTCAAGGAAATTCGTGACTACATGTCGCAAATGATTTTCAAAATTGAAAATCCTGTCTC 

30 GCAACGGATTGTCCGAAATCTCTACACCAAGTATGATAAGGAATTCTACTCCTATCCAGCTGCCAAGACCAACCA 
CCATGCCTTTGAAACGGGCTTGGCCTATCATACGGCGACCATGGTGCXjTTTGGCAGACGCTATTAGC 
CCTCAGCTCAATAAGAGCCTGCTCTATGCGGGGATTATGTTGCATGACTTAGCTAAGGTCATCGAGTTGACGGGGC 
CAGACCAGACAGAGTACACAGTGCGAGGTAATCTTCnTGGACATATCGCTCTCATTGATAGCGAAATTACCAAGA 
CAGTTATGGAACTCGGCATCGATGATACCAAGGAAGAAGTCGTTrTGCTTCGTCATGTCATCCTCAGTCACCACGG 

35 (nTGCTTGAGTATGGAAGCCCAGTCCGTCCACGCATTATGGAAGCAGAGATTATCCATATGATTGACAATCTGGAT 
GCAAGCATGATGATGATGTCAACAGCTCTTGCTTTGGTGGATAAAGGAGAGATGACCAATAAAATCTTCGCTATG 
GATAATCGTTCCTTCTATAAACCAGATTTAGATTAA 

4166.2 

40 ATGAGTGAAAAAGCTAAAAAAGGGTTTAAGATGCCTTCATCTTACACCGTATTATrGATAATCATTG 
CAGTGCTAACITGGTTTATCCCTGCGGGGGCCTTTATAGAAGGTATTTACGAGACTC^ 

GATTTGGGATGTCCTCATGGCACCGATTCGGGCTATGCTrAGGTACTCATCCAGAGGAAGGTTCGCrCATTAAAGAA 
ACGAGCGCAGCGATTGATGTAGCCTTCTTCATCCTTATGGTTGGTGGTTTCCTTGGCATTGTC 
CTCTTGACGTAGGGATTGCCTCTATCGTGAAGAAGTATAAGGGCCGCGAAAAAATGTTAATTTTG 
45 TTTGTTTGCCCTCGGTGGTAO\ACTTATGGTATGGGTGAAGAAACAATGG 

ATGATGGCCGTTGGTTTTGATAGCCTGACTGGTGTTGCAATfATTTrGCTCGGTTCTCAAA 

TACTCTGAATCCATTTGCGACAGGTATTGCTTCAGCGACTGCGGGAGTTGGTACAGGGGACGGTATCGTACTTCGT 
CTGATCTTCTGGGTTACCTTGACTGCTCnTAGTACTTGGTTTGTTTACCGTTATGCGGA 
GACTAAGTCACTGGTTTATAGTACTCGCAAAGAAGATTTGAAACACTTTAACGTAGAAGAATCTTCATCT 
50 TCTACACTTAGCAGCAAACAAAAATCAGTTCTCTrClTATTTGTGTTGACATT 
TCCATGGACAGACCTTGGCGTTACCATTTTTGATGACTrTAATACTTC 
TTGTCGGTTCATCTACTTCTGCACTAGGTACTTGGTACrTCCCAGAAGGCGCAATGCTC^ 
CTGATTGGTGTTATTTATGGTCITAAAGAAGATAAGATTATCTCTTCCTTCATGAATGGTG 

TGTTGCCTTGATCGTAGCGATTGCTCGTGGTATTCAAGTTATCATGAACGACGGTATGATTACCGATACAATCCTC 
55 AACTGGGGTAAAGAAGGCTTGAGCGGTCTATCTTCACAAGTCTTTATCGTTGTAACTTATATCTTCT 
GTCATTCrTGATCCCATCTTCATCTGGTCTTGCCAGCGCAACTATGGGTATCATGGC 
TCCACTTGGAGAATTTGTAAATGTCCGTCCTAGCTTGATTATCACTGCTTACCAATCTGCT^ 

TGATTGCACCAACATCTGGTATTGTGATGGGAGCTCTTGCACTTGGACGTATCAACATTGGTACTTGGTGGAAATT 
CATGGGCAAACTCGTAGTCGCTATTATTGTAGTGACCATCGCCCTTCTrCTCCITGGAA 
60 AA 

4166.3 

ATGAAAATAGATATAACAAATCAAGTTAAAGATGAATTTCTTATATCATTAAAAACCTTGATTT CCT 
TACTCAATGAAGGAGAAAATGGAACACCTTTTGGACAAGCAATCCAAGATGTCCT^ 
65 GTCGAGACATAGGTTTCACTACCTATCTTGACCCTAAAGGTTATTACGGATATGCAGAAATCGGTCAGGGAGCAG 
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AGCTTCTGGCCATTCTCTGTCATTTGGATGTTGTTCCATCAGGTGATGAAGCAGATTGGCAGACACCGCCATTTGA 
AGCAACTATCAAAGACGGCTGGGTATTCGGACGTGGTGTCCAAGATGATAAAGGCC CTTCG CTCGCAGCTCTCTA 
TGCAGTAAAAAGCTTGCTGGACCAAGGTATTCAGTTCAAAAAGCGCGTACGCTITATC nTTG GTACCGATGAGG^ 
AACCCTCTGGCGCTGCATGGCACGCTACAATACCATCGAAGAACAGGCCAGTATGGGCTITGCACC^ 
5 TTTTCCTCTGACCTATGCTGAAAAAGGGCTTCTACAGGTCAAACTTCATGGCCCTGGATCGGATCAACTAGAGC^ 
GAAGTAGGAGGCGCCTTTAACGTTGTACCAGACAAGGCCAACTACCAAGGTCTCCTCTATGAACAGGTTTGTAAC 
GGTCTCAAAGAAGCTGGTTATGATTACCAAACCACTGAACAAACCGTAACGGTTCTCGGAGTGCCAAAGCATGCT 
AAGGATGCTAGTCAAGGTATCAATGCTGTCATCCGACTAGCTACCATTCTTGCTCCTCT 
TCAGTTTTCTTGCAACAC^GCAGGTCAAGACGGCACAGGAAGACAAATCITTGGTC 
10 CTGGTCACCTATCCTTTAATGTCGCAGGTCTCATGATCAATCATGAACGTTCTGAAATCCGTATTGACATTCGGAC 
TCCIX3TCTTAGCTGACAAGGAAGAACTAGTAGAGTTGCTTACAAGATGTGCACAAAACTACCAA 
AGAGTTTGACTATCTAGCGCCTCTATACGTCGCAGAAGACAGTAAACTCGTTAGCACACTGATGCAAAT 
AGAAAAGACTGGCGATAACAGTCCTGCTATTTCATCCGGTGGTGCCACrTTTGCTCG 

GCCnTCGGCGCCTTATTCCCAGGAGCGAAGCAGACAGAACATCAGGCAAATGAATGTGCCGTTCTAGAAGATTTG 
1 5 TACCGTGCTATGGATATTTATGCCGAAGCCGTCTATCGACTTGCAACTTAA 

4169.1 

ATGTCTAATTCATTTGTCAAGTTGTT AGTCTCTCAATTATTTG C AAATTTAGCAG ATA I " rTTCTT 1' AGAGTAACAAT 
CATTGCTAACATATACATTATTTCAAAATCAGTAATTGCCACATCACTAGTTCCTATCTTAATAGGAATATC 

20 TTGTTGCGAGTCTTTTAGTTCCGTTGGTTACTAAAAGGTTAGC^ 

AAGACTATATTATTGGCGATACTGGTAGGAATGTTTACCGTAATGCAATCCGTAGCGCCTTTGGTGACCTATCTAT 
TTGTTGTTGCAATTTCCATACTAGATGGTTTTGCAGCACCCGTTTCCTATGCTATTGTGCCACGCTATGCGACC 
TTGGGTAAGGCTAATTCAGCCTTATCAATGACTGGTGAAGCTGTTCAATTGATAGGTTGGGGA TTAG GTG^ 
TGTTTGCAACAATTGGTCTGTTACCTACCACGTGTATCAATTTAGTCTTGTATATCATTTCT 

25 TTTCTTCCTAACGCTGAAGTGGAGGTGTTAGAGTCAGAAACTAATCTTGAAATTT^ 

TTGCTAGAAATCCTAGATTAAGACTTTTTGTATCAGCAAATTTATTGGAAATTTTTTC 

TCCATTATACTTGTTTTTGTAACGGAGTTATTAAATAAAACGGAAAGTTACTGG 

CTATTGGTATTATAATTAGTGGCTTAATTGCTTrTAGGCTATCTGAAAAGTTCCT 

ATTATTC ACCCC AAATCTAAAAACC ATCC AG A ATCCTTG CCTTAGCTT AG ATCCTGG ATGG'ITTC IT IT 1 ' I C ACCC A 
30 ATGGGTGTTTTTTACTAGACAAAAAAGAGTTTCCCCTTTATGGTATAACT 

AAACTCACATGAACAGTTTACCAAATCATCACTTCCAAAACAAGTCTTTrTACCAACT 

TTTAACCCAGTATGGTGGTCTTATCn'lM'lUnCAGGAACMllUU'l'CCCAGTTGAAACTAAAAGAGCGG 

ATTTAGTAACGAATGACCAACGCCGCTACTGTCGTTATTCGG 

TTAACAGGTTATGGAACGGACTATGCTTGTAAAGAATTGTCAGCTGATGCCTACTT 
35 TCCAAAATTGTTGGAAGGAGGGCAGCTTGCTTCACAGCCAACCTTATCCCGrrri CI 1 1 CCAGAACTGACGAGGAA 

ACAGTCCATAGTTTGCGATGCCTCAACCTTGAATTGGTCGAATTCTTTTTACAGTTT 
TTGTAGATATCGATTCTACCCATTTCACAACITATGGCAAGCAAGAAGGTGTTGCTTATAA 
TCATGGCTATCATCCTCnTTATGCITTCGAGGGGAAGACAGGTTATTGTTTCAATGCCC^ 
GTTATTGTTCTGAAGAGGCAGACAGCTTTATCACACCTGTTTTAGAACGGTTTAATCAACT^ 
40 TAGTGGCTTTGCGACCCCAAAATTATACGATTTAATTGAAAAAACAGGGCAATACTACCTCATA 
AAATACTGTTCTGAGCCGTCTTGGAGACCTTTCCCTCCCTTGCCCACAGGATGAGGACT^ 

TCCGCCTACTCAG AAACTCTCTATCAAG CAGG ATCTTGGTCG C ACAAGCGTCGTGTCTGCC AGTTCTCTG AACG AA 
AAGAAGGAAACTTGTTCTACGATGTTATTTCTCTCGTTACAAATATGACGAGTGGAACAAGCCAAGACCAGTTTCA 
GCTTTATCGTGGACGTGGTCAAGCCGAGAATTTCATCAAGGAGATGAAGGAGGGATTTTTTGG CGATAA A^ 
45 TAGTTCAACCTTAATCAAAAACGAAGTTCGTATGATGATGAGCTGTATCGCCTAC^TCTCT 
CATCTAGCTGGAGGTGACTTCCAAACTTTAACAATCAAACGCTTC 

GTGTTCGAACAGGACGCAAGCAGCTCCTCAAATTGTCTAGTCTCTATGCCTATTCCGAATTGTTTTC^ 

TTCTAGGATTAGAAAAGTCAACCTGAATCTTCCTGTTCCTTATGAACCACCTAGAAGAAAAGCGT 

CATTAA 

50 

4169.3 

ATGATGGAGTTTTTTCAACAGCTTCCTCATTTAGAGCCATATGGCAATCCTCAGTATTTTGTTTATGTGA 
AACCTTG CCC ATCTTT AT AGGTCTCTTTTTC AAG AAACG CTTTG C CTGGT ATG AAGTGTTGGT AAGTCTCTTCTTTA 
TTGTGACCATGTTGGTGGGTCGAAAGACCAATCAACTAGCT^ 
55 C CTGCTTTTCT ACAAG C ATTATCG AAAAAG C AAG G ATGG C AAGTG GGTCTTCT ACTT AGTT AGTTTTCTGTCCCT A 

CTTCCG ATT ATCTTTGTCAAGGTGCAACCAGCTATCAATGG AACG CAGTCTTTGCTT GG GTT CTTG GG AA r i ' 1 1; 1 1 A 
CCTT5ACCTTTCGTTCGGTTGGAATTGTCATCGAGCTGAGAGAT^ 

CTCCGTTTCCnTCTCTrCATGCCAACTTTCTCGAGTGGTCCAATCGATCGCTTTAAGCGATTTAA 
GGCTATTCCTGAGCGAGATGAGTTGATGGATATGCTGGATGAATCTGTCCGCTATATCATGTGGGGCrrrriGTAT 
60 AAGTTTATCCTAGCTCATGTTTTAGGAGAGACCTTACT^ 

TCTTT AATCTCT ATGCCTTGG C AGTT ATGT AT ACTTTTG GTCTGG AACTCTTCTTTG A CTTTG CAGGTT ATTCT ATGT 

TTGCTTTGGCCATCTCAAACrTGATGGGAATCCGTAGCCCTATCAACTTTAACAAGCCCTTTTTAT^ 

AAAGGAGTITTGGAATCGCTGGCATATGAGTCrGTCCITCTGGTrCCGTGACT^ 

GTGTT AACC AG AAAG AAAGTCTTT AAAAATCGT AATGT AACCTC AAG C ATGGCCT AC ATTGT AA AT ATG CTG ATT A 
65 TGGGATTTTGGCATGGTGTGACCTGGTACTATATCGCCTATGGACTCnTTCATGGACTAGGCrTGGTCATCAA 
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TGCCTGGGTTCGCAAGAAAAAAACGCTCAATAAGGAACGGAAAAAAGCAGGGAAGGCTGCCCTACCTGAGAATC 

GCTGGATTCAGTTGCTTGGCATGGTTGTCACTTTCCATGTTGTCATGTTGTCATTCTTAATCn^ 

AT AATCT ATG GTTT AAAAAAT AA 

4169.4 

ATGCTTAAACGCnTATGGATGATCnTCGGACCGGTCTTGATC 

CCTACTGAGATGCATCAT AATCT AGGAGCTGAAAAGCGTTCAGCAGTGGCTACTACTATCGATAGTTTTAAGGAGC 

GAAGTOWVAAAGTCAGAGCACTATCTGATCCAAATGTGCGTTTTGTTCCCIUC 

TTTTGACGGTGCTCATCCTGCGGTATTAGCTGAGAAATACAATCGTTCCTACCGTCCT 

GGAGCTGCATCGCTTAACCAATATTTTGGAATGCAACAGATGTTACCACAGCTGGAGAATAAACAAGTTGTGTAT 

GTTATCTC^CCTCAGTGGTTCAGTAAAAATGGCTATGATCCAGCAGCCTTCCAGCAGTATTT^ 

TGACTAGTTTTCTGAAACATCAATCTGGGGATCAGGCTAGTCAATATGCAGCGACTCGCnTA 

AAACGTAGCTATGAAGGACCTGGTTCAGAAGTTGGCAAGTAAAGAAGAATTGTCGACAGCAGACAATGAAATGAT 

TGAATTATTGGCTCGTTITAATGAACGCCAAGCTTCCrrrriTGGTCA^ 

ATAAGCATGTAGCTAAGTATTTAAAAATCTTGCCAGACCAGlU'rrClU'ATCAGGCAATAGAAGATGTTGTCAAAGC 

AGATGCTGAAAAAAATACITCCAATAATGAGATGGGAATGGAAAATTATTrCTATAATGAGC^ 

TTTGAAGAAATTAAAGGATTCTCAGAAAAGCTTTACCTATCTCAAGTCGCCAGAGTATAATG 

TT AAC ACAGTTTTCTAAATCT AAGGT AAACCCG A' i ' IT 1 1 ATCATTCCACCTGTTAATAAAAAATGGAT G AAC TATG 

CTGGTCTACGAGAGGATATGTACCAACAAACGGTGCAGAAGATTCGCTACCAGTTAGAAAGTCAAGGTTTTACCA 

ATATAGCAGA UUUU ' l 'CTAAGGACGGCGGGGAGCC rri ' Crr iATGAAGGACACCATTCACCTTGGTrGGTrGGGTTG 

GTTGGCTTTTGACAAGGCAGTTGATCCTTTCCTATCCAATCCCACACCAGCTCCGACTTACCAT 

TTTTTCAGCAAAGATTGGGCGACnTATGATGGAGATGTCAAAGAATTTCAATAG 

4169.6 

ATGGAGAAAAACCTCAAGGCTTTGAAACAAACAACAGACCAAGAAGGCCCAGCAATTGAACCTGAAAAGGCAGA 
GGATACCAAGACAGTCCAAAATGGTTACTTCGAGGATGCAGCTGTCAAGGACCGCACCTTGAGTGACTATGCAGG 
TAACTGGCAATCAGTTTATCCTTTCCTTGAAGACGGCACGTTTGACCAAGTCrTTG 

GGTAAGATGACCCAGGCTGAGTACAAGGCTTACTATACAAAAGGCTATCATACAGATGTGACTAAGATTAACATT 

ACTGATAATACTATGGAATTTGTTCAAGGTGGACAAAGCAAGAAATACACTTACAAGTATGTCGGTAAGAAAATT 

TTGACTTACAAGAAAGGCAATCGTGGCGTGCGTTTCCTCTTTGAAGCCACAGATGCTGACGCTGGACAA 

ATGTTC AGTTTAGTGACCACAATGTTG CCCCAGTT AAGGCAG AAC ATTTCCATATCTTCTTTG G AGGCAC AAGCCA 

AGAAGCCCTCTrrGAAGAAATGGACAACrGGCCAACCTACTACCCAGATAACCTATCTGGCCAAGAAATCGCCCA 

AGAAATGTTGGCGCATTGA 

4170.3 

ATG AAAG ATGGTCATTTGCTAGCCCATCATATTCGTTTGTTG AATGGGCGG A I'fi" 1"! CAAAAGTTACTG AGTC AAG 

ATCCTGAGGCTCnTTATAGGGGTGAACAGGGCAAGATTITAGCGGTTTTATGGAATAGTGAAACTGGCrG 

CrGCGACAGATATCGCGCTTGCGACTGGACTTGCGAATAATACGCTGACGACTATGATAAAAAAGCTAGAGGAAC 

AAAAGCTTGTAATTGTT AGTCCGTGTG G AAAAG AC AAG CGTAAG AAGTATTT AGTTTTAACGG AGTTAGGC AAGT 

CCCAGAAAGAAGTGGGGCATCGTGTCAGTCAGAAATTGGATACTATCTTTTACAAAGGATTTTCAGAGGAAGAAA 

TTGACCAATTTGAAGGTTTTCAAGAAAGAATTITGGCGAATCTGAAAGAGA^ 

4170.4 

ATGACTAATTTAATrGCAACITTTCAGGATCGTTTrAGTGATTGGTTGACAGCTCT 

GLTlTl<jACCTTGTTACTAGCTATTTTGCTTGCGATTCCCTTGG 

ACTGGGTCnTGCAGATTGGAGGTATTTTCCAGACCATCCCGTCTCT^ 

GGAATTGGGACCTrGCCGGCTTTGACAGCTCTAGTGATTTATGCGATTTTCCCTATTTTGCA 

GGCTGAAGGGAATTGATCCGAACCTGCAAGAGGCTGGGATTGCCTTTGGGATGACCAGATGGGAACGTCTCAAGA 

AATITGAAATTCCACTCGCCATGCCTGTTATCATGTCTGGGATTCGGACGGCAGCTGTTTTGATTATCGGTACGGC 

AACCTTGGCGGCCTTGATTGGTGCAGGGGGACTAGGTTCCOT 

TTGATTTTGATTGGGGCACTITCTTCr^AGTGCTAGCCATTC 

CAAAATTACGGACGATTTTCTCAGGTTTTGCCTTGGTGGCTTTATTACnXjGGTCrc 

GTTCAAAAAGAGAAGGAAAACrrGGTTATTGCrGGGAAAATAGGTCCAGAACCAGAAATTTTGGCCAATATGTAT 

AAGTTGCTGATrGAAGAAAATACCAGCATGACTGCGACTGTTAAACCGAATTTTGGGAAGACA^ 

AAGCTCTGAAAAAAGGCGATATTGACATCTATCCTGAATTTACTGGTACGGTGACTC 

ACCCAAGGTGAGTCATGAACCAGAACAGGTTTATCAGGTGGCGCGTGATGGCATTGCTAAGCAGGATCATCTAGC 
CTATCTCAAACCGATGTCTTATCAAAACACCTATGCTGTAGCTGTTCCGAAA 

AAGACCATTTCAGACTTGAAAAAAGTGGAAGGGC^GTTGAAGGCAGGTTTTACACrCGAGTTTAACGACCGTGAA 
GATGGAAATAAGGGCnTGCAATCAATGTATGGTCTCAATCTCAATGTAGCGACCATTGAGCCAGCCCTTCGCTATC 
AGGCTATTCAGTCAGGGGATATTCAAATCACGGATGCCTATTCGACTGATGCGGAATTGGAGCGTTATGATTTACA 
GGTCTTGGAAGATGACAAGC\ACrrcrrCCCACCrTATCAAGGGGCTCCACrCATGA^ 

ACACCCAGAGTTGGAAAGAGTTCITAATACATTGGCTGGTAAGATTACAGAAAGCCAGATGAGCCAGCTCAACTA 
CCAAGTCGGTGTTGAAGGCAAGTCAGCAAAGCAAGTAGCCAAGGAGTTTCTCCAAGAACAAGGTTTGTTGAAGAA 
ATGA 
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ATGATGCATACTTATTTGCAAAAGAAAATTGAAAATATCAAAACAACCCTAGGTGAAATGTCAG GTGG TTACCGT 
CGTATGGTTGCGGCTATGGCTGATTTAGGATTTTCAGGAACTATGAAGGCTATCTGGGATC 
GTAGTTTTGCCCAGTGGATTTATTTGCTGGTTTTAGGAAGTTTTCCTCTCTGGCTC 
5 ATTGTTGACTGGATTGGGATGATTTGTAGCnTGACAGGGATTATCTGTGTAATCTTTGTA 
GTAATTATCTTTTTGGCITGATTAACTCTGTTATTTACCT^ 

CTGACGACACTTTACTTCACAGTCATGCAGCCAATTGGACTTCTAGTTTGGATTTATC^ 
AAAAGCAGGAGTTTGTCGCGCGTAAACTGGACGGCAAGGGCTGGACAAAGTATCTTTCCATTAGT^^ 
GGTTGGCCTTTGGCTTCATTTATCAGTCTATTGGTGCCAATCGTCCCTATCG 
1 0 TGGGGTAGGGCAAATCCTC ATG ACAG CTGTTTACCGTG AAC AGTGG ATATTCTGGGCGG CTACCAATGTCTTTTCA 

ATCTATCTCTGGTGGGGAGAAAGCCTGCAAATTCAAGGGAAATATCTAATTTATCTCAT^ 
GGTATCAATGGAGCAAGGCAGCTAAGCAGAATACTGATTTACTTAACTAG 



1 5 ATGAGAAATATGAAGGCAAAATATGCTGTTTGGGTGGCTTTTTT 
TGCAGGTGGAGTATTTGGTTCTAGCXjCTGTTCTTGCTGACTCTC 

ATATCAGCnTTTCTAGAAACAATCTCCAATCGTGAAGAAGACAATCAGTACAC CTTGG GCTATAAGCGGTTTAGCC 
TGCTAGGAGCCTTGGTAACAGCTGTGATTCTCGTAACGGGCTCTGTTCTAGTCATTTTGGAAAATGTCACGAAGAT 
TTTGCATCCGCAACCAGTCAATGATGAGGGGATTCTCTGGTTAGGAATTATTGCGATTACTATCAATCTGTT 
20 AGTCTGGTGGTTGGTAAGGGAAAGACAAAGAATGAGTCTATTCTGAGTCTGC^TTTTCTGGA AGATA CGCTAGGG 
TGGGTAGCTGTTATCCTGATGGCGATTGTTCTTCGATTTACGGACTOGTATATCC TAGA TCCT 
CA TnvmX JnTATTCTTTCAAAAGCCCTTra 

AAGGTCTTGATATCAAGCAAGTAAAGAGTGGCCTGGAGCGATTGGACAATGTGGCCAGCCTTAATCAGCnTAATC 
TCTGGACTATGGATGCnTTGGAAAAAAATGCCATTGTCCATGTTTGTCTAAAAGAAATCK} 
25 TAAAGAGTCTATTCGAATTTTCCTAAAAGATTGTGGTTITCAAAATATTACCATTGAAAT^ 

ACTCACCAAACCCATAAGCGAAAGGTGTGTGACTTGGAACGGAGTTATGAGCATCAACATTAG 

4170.8 

ATGATTGAATACAAAAATGTAGCACTGCGCTACACAGAAAAGGATGTCTTGAGAGATGTCAACTTACAGATTGAG 
30 GATGGGGAATTTATGGTTTTAGTAGGGCCTTCTGGGTCAGGTAAGACGACCATGCTCAAGATGATTAACCGT^ 

TGG AACC AACTG ATGG AAAT ATTTAT ATGG ATG GG AAGCG C ATCAAAG ACTATG ATG AG CGTG AACTTCGTCTTT 
CTACTGGTTATGTTTTACAGGCTATTGCTCTTTTTCCAAATCTAACAg 

ATGAAGGGGTGGAGCAAGGAAGAAATTACGAAGAAAACAGAAGAGCTTTTGGCTAAGGTTGGTTTACCAGTAGCC 
GAGTATGGGCATCGCTTACCTAGTGAATTATCTGGTGGAGAACAGCAACGGGTCGGTATTGTCCGAGCTATGATTG 

35 GTCAGCCCAAGATTTTCCTCATGGATGAACCCTTITC 

GACAAAAGAATTGCATAAAGAGTTTGGGATGACAACGATTTTTGTAACCCATGATACGGATG AAGC CTTGAAGTT 
GGCGGACCGTATTGCTGTCITGCAGGATGGAGAAATTCGCCAGGTAGCGAATCCCGAGACAATTTTAAAAGCGCC 
TGCAACAGACTTTGTAGCAGACTTGTTTGGAGGTAGTGTTCATGACTAA 

40 4171.1 I 

ATGTCAGCAGTTGCTATTTCAGCTATGACCAAGGTTATGCAAGAAACCCACGGAAATCCTTCTAGTATTCATGGTC 
ATGGTCGTCAAGCTGGCAAACTCTTGCGAGAAGCCCGTCAGGAACTAGCCCAGTTACTAAGGACAAAACCTCAAC 
ATATCTTTTTCACTTCTGGTGGGACTGAAGGCAATAATACTACCATCATTGGCT 
AGGAAAACATATCATCACAACTGCCATCGAGCACCATGCTGTCCTTGAAACAATTGATTACTTG GT^ 
45 GGGTTTGAAGCAACCATTATCCAGCCAGAAAATCAAGAAATCACAGCCCAGCAAATTCAAAAGGCTTTACGTGAC 
GATACGArrTTGGTTTCTACCATGTTTGTCAATAATGAGACAGGAAACCTACTGCCCATCG 

TACTCAAGCAACACCCTGCTGCCTATCATGTTGATGCAGTTCAGGCTATTGGTAAAATC CCAAT TCATTCAGAAGA 

ATTGGGCATTGATTTTCTCACTGCTTCTGCCCACAAATTCCATGGTCCTAAGGGAATCGGTTTTCTCT 

GCATGGACTTTGATTCCTATCTACATGGCGGAGACCAGGAACAGAAAAAACGTGCAGGAACTGAAAATCTGCCTG 

50 CCATTGTAGGCATGGTTGCAGCCCTAAAAGAAGACCTAGAAAAACAAGAAGAACATTTTCAACATGTACAAAATC 
TAGAAACTGCCnTTCTGGCAGAGCTGGAGGGCATTCAGTATTACCTGAATAGAGGAAAACATCATCTCCCTTATGT 
TCTCAATATTGGATTTCCTGGTCAGAAAAATGACCTCTTACTCCTTCGGCTAGATTTAGCTGGAA 
CTGGCTCAGCCTGTACTGCAGGCGTTGTCCAATCCAGCCATGTTCTTGAAGCCATGTATGGCGCAAATTCAGAACG 
(nTGAAGGAATCCCTTCGCATCAGTTTGTCGCCACAAAATACCGTTGAAGACCTACAAACCCTCGCAAA^ 

55 AAAGAAATTATCGGAGGTTAG 



60 4172.1 

ATGTTATTCAAATTATCTAAGGAAAAAATAGAGCTAGGOT^ 
GTTTTCCCTTGGTCATTTTACTAGGCTCTCTTCTTT^ 

TATITTGATCATCTTTTCACTGCTGTCTCTGCAGTCTGTGTGACGGGTCTCTCAACCCTTCCAGT 
TAATATCTGGGGTCAAATAATCTGTTTGCTCTTGATTCAGATCGGTGGTCTAGGGCT 
65 TCTATATCCAGAGCAAGCAAAAGCTTAGTCTTCGTAGCCGTGCAACTATTCAGGATAGTTTTAGTTATGGAGAAAC 
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TCGATCTTTGAGAAAGTTTGTCTATTCTATTTTTCTCACGACCTTTTTG 
GTTTTCGCCTTATTCCTCAACTTGGCTGGGGACGTGGTCTTTTT 

AATGCCGGTTTTGATAATTTAGGGAGCACCAGTTTATTTGCTTTTCAGACCGATTTACTGGTCAA 

AGGCTTGATTATTACAGGCGGCCTTGGTTTTATGGTCTGGTT^ 
5 GGACGTCTGCACTTTCATACGAAGCTTGTACTATTATTGACTATAGGTTTGTTGTTATTTGGAA 

CTTTCnTGAGTGGAACAATGCTGGAACGATTGGCAATCTCC 

ACAGTGACGATGCGAACAGCTGGCTTTTCTACGATAGATTATACTCAGGCTCATCCTGTGA CT 
TCTTACAGATGTTTCTAGGTGGGGCACCTGGAGGAACAGCTGGGGGACTCAAGAT^ 

GGTCTTTGCACGAAGTGAGCTTCTAGGCTTGCCTCATGCCAATGTTGCGAGACGAACGATCGCGCCGCGAACGGTT 

10 CAAAAATCCTTTAGTGTCTTTATTATCTITTTGATGAG 

AGGCAATCCTCCCTTTATCCACCTCGTATTTGAAACCATTrCAGCTCTTAGTACAGTTC 
ACTCCTGACCTTGGGAAATTGGCTCTCAGTGTTATCATGCCACTTATGTTTATGGG 

ACGAATTGGTCCCTTGACCTTGTTTGTTAGCnTGGCAGATTACCATCCAGAAAAGAAAGATATGATTCACTATATG 
AAAGCAGATATTAGTATTGGTTAA 

15 

4172.2 

ATGTCAGATCGTACGATTGGAATTTTGGGCTTGGGAATTTTTGGGA 

ATATGAATATTATCGCTATTGATGACCACGCAGAGCGCATCAATCAGTTTGAGCCAGTTTTGGCGCGTGGAGTGAT 
TGGTGACATCACAGATGAAGAATTATTGAGATCAGCAGGGATTGATACCTGCGATACCGTTGTAGTCGCGACAGG 

20 TGAAAATCTGGAGTCGAGTGTGCTTGCGGTTATGCACTGTAAG 

AAAAGTCAGACCGCTAAGAAAGTGCTAGAAAAGATTGGAGCTGACTCGGTTATCTCGCCAGAGTATGAAATGGGG 
CAGTCTCTAGCACAGACCATTCITTTCCATAATAGTGTTGATGTCTTTCAGTTGGATAAAAATGTGT 
AGATGAAAATTCCTCAGTCnTGGGCAGGTCAAAGTCTGAGTAAATTAGACCTCCGTGGCAAATACAATCTGAATA 
TTTTGGGTTTCCGAGAGCAGGAAAATTCCCCATTGGATGTTGAATTTGGACCAGATGACCTCTTGAAAGCAGATAC 

25 CTATATTTTGGCAGTCATCAACAACCAGTATTTGGATACCCTAGTAGCATTGAATTCGTAA 



ATGAAGTTATTGTCTATCGCAATTTCTAGCTATAATGCAGCAGCCTATCTTCATTACTGTGTG 

TTGGTGGTGAGCAAGTTGGGATTTTGATTATCAATGACGGGTCTCAGGATCAGACTCAGGAAATCGCTGAGTGTTT 
30 AGCTAGCAAGTATCCTAATATCGTTAGAGCCATCTATCAGGAAAATAAATGCCATGGCGGTGCGGTCAATCGTGG 
CnTGGTAGAGGCITCTGGGCGCTATTTTAAAGTAGTTGACAGTGATGACTGGGTGGATCCTC G 
ATTCTTGAAACCTTGCAGGAACTTGAGAGCAAAGGTCAAGAGGTGG 

AGGAAGGGCAGTCTCGTAAGAAGAGTATGAGTTACGATTCAGTCTTGCCTGTTCGGCAGATTTTTGGCTGGGACCA 
GGTCGGAAATTTCTCCAAAGGCCAGTATACCATGATGCACTCGCTGATTTATCGGACAGATTTGTTGCOT 
35 CAGTTCTAA 



ATGAAATTCAATCCAAATCAAAGATATACTCGTTGGTCTATTCGCCGTCTCAGTGTCGGTGTTGCCTCAGTTGTTG 
TGGCTAGTGGCTTCTTTGTCCTAGTTGGTCAGCCAAGTTCTGTACGTGCCGATGGGCTCAATCCAACCCCAGGTCA 
40 AGTCTTACCTGAAGAGACATCGGGAACGAAAGAGGGTGACnTATCAGAAAAACCAGGAGACACCGTTCTCACTCA 
AGCGAAACCTGAGGGCGTTACTGGAAATACGAATTCACITCCGACACCTACAGAAAGAACTGAAGTGAGCGAGGA 
AACAAGCCCTTCTAGTCTGGATACACTTTTTGAAAAAGATGAAGAAGCTCAAAAAA^ 

CTTAAAAGAAACTGTAGATACAGCTGATGTGGATGGGACACAAGCAAGTCCAGCAGAAACTACTCCTGAACAAGT 
AAAAGGTGGAGTGAAAGAAAATACAAAAGACAGCATCGATGTTCCTGCTGCTTATCTTGAAAAAGCT 
45 AGGTCCTTTCACTG C CGGTGTAAACC AAGTAATTCCTTATG AACTATTCGCTGGTG AT GGT A TGTTAACTCGTCT A 

TTACTAAAAGCTTCGGATAATGCTCCTTGGT^ 

GATTAACAAAAGGGAAATACTTCTATGAAGTAGACTTAAATGGCAATACTGTTGGTAAACAAGGTCAA^ 
TTGATCAACTTCGCGCTAATGGTACTCAAACTTATAAAGCTACTGTTAAAGTTT 

TGACTTGACTAATCTAGTTGCTACTAAAAATGTAGACATCAACATCAATGGATTAGTTGCTAAAGAAACAGTTCAA 
50 AAAGCCGTTGCAGACAACGTTAAAGACAGTATCGATGTTCCAGCAGCCTACCTAGAj^\AAGCCAAGGGTGAAGGT 
CCATTCACAGCAGGTGTCAACCATGTGATTCCATACGAACTCTTCGCAGGTGATGGCATGTTGACTCGTCTCTTGC 
TCAAGGCATCTGACAAGGCACCATGGTCAGATAACGGCGACGCTAAAAACCCAGCCCTATCTCCACTAGGCGAAA 
ACGTGAAGACCAAAGGTCAATACTTCTATCAAGTAGCCTTGGACGGAAATGTAGCTGGCAAAGAAAAACAAGCGC 
TCATTGACCAGTTCCGAGCAAATGGTACTCAAACTTACAGCGCTACAGTCAATGTCTATGGTAACAAAGACGGTA 
55 AACCAGACTTGGACAACATCGTAGCAACTAAAAAAGTCACTATTAACATAAACGGTTTAATTTCT 

TTCAAAAAGCCGTTGCAGACAACGTTAAAGACAGTATCGATGTTCCAGCAGCCTACCTAGAAAAAGCCAAGGGTG 
AAGGTCCATTCACAGCAGGTGTCAACCATGTGATTCCATAGGAACTCTTCGCAGGTGATGGTATGTTGACTCGTCT 
CTTGCTCAAGGCATCTGACAAGGCACCATGGTCAGATAACGGTGACGCTAAAAACCCAGCCCTATCTCCACTAGG 
TGAAAACGTGAAGACCAAAGGTCAATACTTCTATCAATTAGCCTTGGACGGAAATGTAGCTGGCAAAGAAAAACA 
60 AGCGCTCATTGACCAGTTCCGAGCAAACGGTACTCAAACTTACAGCGCTACAGTCAATGTCTATGGTAACAAAGA 
CGGTAAACCAGACTTGGACAACATCGTAGCAACTAAAAAAGTCACTATTAACATAAACGGTTTAAT^ 
AAC AGTTC AAAAAGCCGTTGC AG AC A ACGTT AAG AC AGT ATCG ATGTTCC AG C AG C CT A CCTAG 
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ATG AAACTAAAAAGTT ATATTTTGGTTGG ATATATTATTTCAACCCTCTT AACCATTTTG GTTGTTTTTTG GGCTGT 
TCAAAAAATGCTGATTGCGAAAGGCGAGATTTACTTTTTGCTTGGGATGACCATCGTT 

GGGATTAGTCTCTTTCTCCTATTGCCAGTCTITACGTCGTTGGGCAAACTCAAGGAGCATGCC AAGCG GGTAGCGG 
CCAAGGATTITCCTTCAAATTTGGAGGTTCAAGGTCCTGTAGAATTTCAGCAATT^ 
5 GTCCCATGATTTGCAGGTAAGCHTTGATTCCTTGGAAGAAAGCGAACGAGAAA AGGG Cr^ 

GTCGCATGATATTAAGACTCCTATCACTTCGATCCAAGCGACGGTAGAAGGGATTTTGGATGGGATTATCAAGGA 
GTCGGAGCAAGCTCATTATCTAGCAACCATTGGACGCCAGACGGAGAGGCTCAAT AAACTG GTTGAGGAGTTGAA 
TTTTTTGACCCTAAACACAGCTAGAAATCAGGTGGAAACTACCAGTAAAGACAGTATTTTTCT 
ATTGAGTGCATGAGTGAATTTCAGTTITTGATTGAGCAGGAGAGAAGAGATGTCCACTTGCA GGTAAT CCCAGAGT 
1 0 CTG CCCGG ATTG AGGG AG ATT ATG CT AAGCTTT CTCGT ATCTTG GTG AATCTGGTCG AT AACG CTTTT AAAT ATTC 

TGCTCCAGGAACCAAGCTGGAAGTGGTGGCTAAGCTGGAGAAGGACCAGCTTTCAATCAGTGTGACCGATGAAGG 
GCAGGGTATTGCCCCAGAGGATITGGAAAATATTITCAAACGCCTTTATCGTGTCGAAACTTCGCGTAACATGAAG 
ACAGGTGGTCATGGATTAGGACTTGCGATTGCGCGTGAATTGGCCCATCAATTGGGTGGGGAAATCACAGTCAGC 
AGCCAGTACGGTCTAGGAAGTACCTTTACCCTCGTTCTCAACCTCTCTGGTAGTGAAAATAAAGCCTAA 

15 

4172.6 

ATGTTTGGTCAAACGGCTCAACATGGTCnTACGAATAGCCTGAAAGACTTCTGG 

CACAATrGGCG rrilU ' ri GCCAGATGCTCCGCTGTTCCAGATCGGTTGAGCAGGGTACTGGAAATCACCGTCGTGA 
GTTCAATATGATTCAGCAGATATTCTCGCATTTTGGGATC 
20 TCGATTGACCTTGAGTTGCTGGTCAATGCACTTAATCATCACTTGCTCATTGACAGACTGGTCCTCA 
AAATAACGATAGAAATCGACAGGCAGATAGTACATGGTCnTGACCTGCTGAAGG 

TCGACATAAAAAGTATGTTCAGGCAGTTAGAACTGGCTAGCACGCAACAAATCTGTCCGATAAATCAGCGAGTGC 
ATCATGGTATACTGGCCTTTGGAGAAATTTCCGACCrGGTCCCAGCCAAAAATCTGCCGAACAGGCAAGACTGA 

25 4174.1 

ATGGAACATTTAGCAACTTArTTTTCAA 

GATTAAGCGGTATGGGGTCnXjCTTATGGAGTTGGTAAGGCrGGGCAATCTC 
CTGAAAAGTTTGCCTCAGCTTTGATATTGCAATTATrGCCCGGAACACAAGGATTATATGGTTT^ 
TITAATTTGGTTGCAATTAACTCCAGAACTTCCriTAGAAAAAGGCGTTGC^ 
30 CTATTGTAGGATACTnrTTCAGCTAAGCATCAAGGAAATGTAGCAGTAGCGGGAATGCAAA TCTTG GCTAAAAG 
CAAAAGAATTCATGAAGGGAGCAATTTTAGCTGCCATGGTAGAAACCTATGCAATTCTTGC^ 
CATTTTGACCCTTCGTGTATAA 
4175.2 

ATGTTAAAATCAGAAAAACAATCACGTTATCAAATGTTAAATGAAGAATTGTCCTTCCTATTGGAAGGCG AAACC 
35 AATGTTTTGGCTAATCITTCCAACGCCAGTGCTCTCATAAAATCACGTTTTCCTAATACCGTATTTGCA 

TTTGTTCGATGGAAAGGAATTGGTTTTAGGCCCCTTCCAAGGAGGTGTTrCCTGCATCCGTATTGCACTAGGCAAG 
GGTGTTTGTGGTGAGGCAGCTCACTTrCAGGAAACTGTTATTGTTGGAGATGTGACGACCT 
CTTGTGATAGTCTAGCTAAAAGTGAAATTGTGGTGCCGATGATGAAGAATGGTCAGTTACTTGGAGTTCT 
GGATTCTTCAGAGATrGAGGATTACGATGCTATGGATCGAGArrATTTGGAACAATTTGTCGCTATm 
40 AAG AC AG C ATGG G ACTTT ACG ATGTTTG AGG AA AAATCTT AA 

4175-3 

ATGTCAGTATTAGAGATCAAAGATCTTCACGTTGAGATTGAAGGAAAAGAAATTTTA AAAG GGGTTAACCTGACC 
CTGAAAACAGGAGAAATrGCCGCTATCATGGGACCAAATGGTACAGGTAAATCGACTCTTTCTGCCGCTATCATG 

45 GGAAATCCAAACTATGAAGTAACTAAAGGTGAAGrrrrGrTTGATGGCGTAAACATCC^ 

GAGCGTGCGCGTATGGGACTTTTCCTTGCTATGCAATACCCATCAGAAATCCCTGGAATTACCAAT^ 
TTCGTGCCGCTATGAATGCGGGTAAAGAAGATGATGAGAAGATITCAGTrCGTGAGTrTATTACTAAGCTAGATGA 
AAAAATGGAATTGCTCAACATGAAAGAAGAAATGGCAGAGCGTTACCTCAACGAAGGCTTCTCTGGTGGTGAGA^ 
AAA ACG C AATG AA ATTCTTCAACTTTTG ATGTTGG AG CC AACATTTG CTCTTTTGG A CG AG ATTG ACTC AG GTCTT 

50 GATATTGACGCTCTTAAAGTTGTGTCTAAAGGTGTCAATGCCATGCGTGGTGAAGGTITTGGTGCr 
CTCACTACCAACGTCTTTTGAACTATATCACACCTGATC^ 
TGGTGGTCCAGAATTGGCTGCGCGTTTGGAACGTGAAGGATACGCAAAATTAG 
CAAGGAAGAATTGTAA 

55 4174.4 

ATGCCCTACAAAAGACAAAGGAGTTTTTCAATGGCACTTTCTAAACTAGATAGC 

ACCATTCGAAAAATCCACATCACCAAGGGAAGTTAGAAGATGCTGAGCAAATCAGTCTCAACAATCCGACTTGTG 
GGGATGTCATCAACCTCTCTGTCAAGTTrGATGCAGAGGACCGTTTGGAAGATATTGCTTTTCTA AATTC AGGATG 
CACGATTTCAACTGCTTCTGCTAGTATGATGACAGATGCCGTTTTAGGAAAAACCAAACAAGAAAT^ 
60 GCGACTATTTTTTCTGAAATGGTTCAAGGGCAAAAAGATGAGCGTCAAGACCAACTTGGAGACGCGGCATTCIT 
TCAGGTGTTGCCAAATTCCCTCAAAGAATCAAGTGTGCAACCCTAGCTTGG 
AATCAAGAAAAACAGTAA 
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ATGAAAATTCAAGACCTATTGAGAAAAGATGTCATGTTGCTAGATTTGCAGGCAACTGAAAAAACAGCT 
GACGAGATGATTAAAAATTTGACAGACCACGGTTATGTAACAGATTTTGAAACATTTAAAGA^ 
CGTGAAGCTTTGACTTCTACTGGTTTGGGTGATGGAATCGCAATGCCTCACAGCAAAAACGCTGCTGTCAA^ 
GCGACAGTTCTATTTGCTAAGTCAAATAAGGGTGTTGACTACGAGAGCTTGGATGGACAAGCAACTGACCT 
5 TCATGATTCCAGCTCCAGAAGGTOCCAATGATACTCACTTG 

AGACGGTTTTGCAGACAAACTTCGTCAAGCAACATCTGCAGACCAAGTTATCGAACITTTT 
AAAACTGAGGAACTTGTTCAAGCACCTGCTAATGACTCTGGTGACTTTATCGTAGCT^ 

GTATTGCCCACACnTACATGGCCCAAGAAGCCCITCAAAAAGTAGCTGCTGAAATGGGGGTTGGTATCAAGGTCG 
AAACCAACGGTGCTAGCGGTGTTGGAAATCAACTAACTGCAGAAGATATCCGTAAGGCTAAAGCTATTATCATTG 
10 CAGCAGACAAGGCCGTTGAAATGGATCGATTTGATGGAAAACCATTGATCAATCGTCCAGTTGCTGACGGTATCC 
GTAAGACAGAAGAGCTAATTAACTTGGCTCITTCAGGAGATACTGAAGTCT 
CTGCAACAGCCTCTAACGAAAAACAAAGCCTTGGTGGTGCC^ 

TGTTACCATTCGTTATCGGTGGTGGTATCATGATTGCCCTTGCCTTCTTGATTGACGGTGCnTTG 

GAAAACCTTGGCAATCTTGGTTCTTACCATGAGTTAGCTTCTATGTTCATGAAAATTGGT 
1 5 G ATG CTTCCAGTCTTTG CGG GTT ATGTTGCCT ACTCT ATTGCTG AAAAACCGGGTTTGGT AG C AG GTTTCGTGGCT 

GGTGCTATTGCCAAAGAAGGTTTTGCCTTTGGTAAAATTCCrTATGCCGCAGGT^ 

CAGGTGTCTCATCTGGTTTCCTAGGTGCCCTTGTTGGTGGATTTATCGCAGGTGCCTTG 

TACGTTAAAGTTCCTCGTTCACTCGAAGGTGCTAAATCAATCCTTCT 

GATTTGTTATGCTAGCTGTGAATATCCCAATGGCTGCAATCAACACT 
20 AGGAGGTTCAGCTGTCCTTCITGGTATCGTCCTTGGTGGAATGATGGCTGTTGACATGGGTGGACCAGTTAATAAA 

GC^GCTTATGTCITTGGTACAGGTACGCTTGCAGCAACrGTTTCTTC^ 

TGGCTGGAGGAATGGTGCCACGACTTGCAATCTTTGTCGCAACTCIT^ 

ACGTAACTCTGGTTTGACAAACATCATCATGGGCnTGTCATTTATCACTGAGGGAGCGATTCCATTTGGTGCCGCT 
GACCCAGCTCGTGCGATTCCAAGCTTCATCCITGGTTCAGCAGTAG 
25 AACTCATGGCGCCACACGGAGGAATCTTCGTTATCGCCCITACnTCAAATGCTCTC 

GTAGGAGCAATCGTAAGTGGTGTGGTTTATGGTTACCTACGCAAACCACAAGCATAA 

4175.6 

ATGGCAAACAAGAATACAAGTACAACAAGACGGAGACCGTCTAAAGCAGAACTGGAAAGAAAAGAAGCGATTCA 
30 ACGAATGTTGATTTCGTTAGGAATTGCGATTTTATTGATTTTCGCAGCCrrCAAAT^ 
CTTTATAATTTAATTCGCTTGCTAGTGGGTAGC 

TTTCAAGTGG ATACG AAAACAGGAAGG ACTCTTATCTGGCTTTTTC ITG 
AGGCCTACTTGGTTTGGAAATATGGTTTGGACAAGTCCGTTCTAAAAGGGAC CATG GCTCAGGTTGTG ACAGA TCT 
GACTGGTTTTCGAACGACTAGCTTTGCrGGAGGGGGCTTGATCGGGGTCGCTCTTTATAT^ 

35 TTTTCAAATATCGGAACTTACnTTATTGGTTCTATCTTGATTTTAGTGGGTTCTCT 

TACGATATTGCTGAATTTTTCAGTAGAGGCTTTGCCAAATGGTGGGAAGGGCACGAGCGTCGAAAAGAGGAAC^ 
TTTGTCAAACAAGAAGAAAAAGCTCGCCAAAAGGCTGAGAAAGAGGCTAGATTAGAACAAGAAGAGACTGAAAA 
AGCCTTACTCGATTTGCCTCCTGTTGATATGGAAACGGGTGAAATTCTGACAGAGGAAGCTGTTCAA^ 
CCTATTCCAGAAGAAAAGTGGGTGGAACCAGAAATCATCCTGCCTCAAGCTGAACTTAAATTCCCTGAACAGGAA 

40 GATGACTCAGATGACGAAGATGTTCAGGTCGATTTTTCAGCCAAAGAAGCCCTTGAATACAAACIT 

CAACTCTTTGCACCAGATAAACCAAAAGATCAGTCTAAAGAGAAGAAAATTGTCAGAGAAAATATCAAAATCTTA 
GAAGCAACCITTGCTAGCTTTGGTATTAAGGTAACAGTTGAACGGGCCGAAATTGGGCCATCAGTGACCAAGTA^ 
GAAGTCAAGCCGGCTGTTGGTGTAAGGGTCAACCGCATTTCCAATCTATCAGATGACCTCGCTCTAGCCTTGGCTG 
CCAAAGATGTCCGGATTGAAGCACCAATCCCTGGGAAATCCCTAATCGGAATTGAAGTGCCCAAC TCCGA TATTG 

45 CCACTGTATCTTTCCGAGAACTATGGGAACAATCGCAAACGAAAGCAGAAAATTTCTTGGAAAT^^ 
AGGCTGTTAATGGAACCGCAAGAGCTTTTGACCTTTCTAAAATGCCCCACTTC 

AGGGAAGTCAGTAGCAGTTAACGGCATTATTGCTAGCATTCTCATGAAGGCGAGACCAGATCAAGTTAAATTTAT 
GATGGTCGATCCCAAGATGGTTGAGTTATCTGTTTACAATGATATTCCCCACCTCTTGATTCC AGTCG TGACCAAT 
CCACGCAAAGCCAGCAAGGCTCTGCAAAAGGTTGTGGATGAAATGGAAAACCGTTATGAACTCTTTGCCAAGGTG 

50 GGAGTTCGGAATATTGCAGGTTTTAATGCCAAGGTAGAAGAGTTCAATTCCCAGTCTGAGTACAAGCAAATTCCG 
CTACCATTCATTGTCGTGATTGTGGATGAGTTGGCTGACCTCATGATGGTGGCCAGCAAGGAAGTGGAAGATGCTA 
TCATCCGTCTTGGGCAGAAGGCGCGTGCTGCAGGTATCCACATGATTCTTGCAACTCAGCGTCCATCTGTTGATGT 
CATCTCTGGTTTGATTAAGGCCAATGTTCCATCTCGTGTAGCATTTGCGGTTTCATCAGGAACAGACTCCCGTACG 
ATTTTGGATGAAAATCGAGCAGAAAAACTTCITGGTCGAGGAGACATGCT^ 

55 CCAGTTCGTCTCCAAGGCTCCTTTATCTCGGATGACGATGTTGAGCGCATTGTGAA CTTCA TCAAGACTCAGGCAG 
ATGCAGACTACGATGAGAGTTTTGATCCAGGTGAGGTTTCTGAAAATGAAGGAG 
GTGGTGATCCGCTTTTTGAAGAAGCTAAGTCTTTGGTTATCGA^ 

TCGTTTATCAGTTGGATTTAACCGTGCGACCCGTCTCATGGAAGAACTGGAGATAGCAGGTGTCATCGGTCCAGCT 
GAAGGTACCAAACCTCGAAAAGTGTTACAACAATAA 

60 

4176.1 

ATGAGTTATTTTAAAAAATATAAATTCGATAAATCCCAGTTCAAACTTGGTATGCGAACCTTTAAA 
CTGTTTTTCTAGTTCTCTTGA rri 1 - l GG Cl ' riUU ' r GGCTGGAAAGGTCTTCAAATTGGTGCTTTGACAGCCGri'rrTA 
GCCTGAGGGAGAGTTTTGATGAGAGTGTTCATTTTGGGACTTCGCGTATTCTAGGAAATAGTATCGGTGGACTCTA 
65 TGCCTTGGTCTTCTTCTTATTAAATACCTTTTTC 



SUBSTITUTE SHEET (RULE 26) 



WO 00/06737 



58 



PCT/GB99/02451 



CCATGTTAACCATTATGACAAATGTAGCCATGAATAACAAAGCAGGGGTTATTGGTGGTGTAGCAGCTATGTTAAT 
CATTACCCTATCAATTCCAAGTGGTGAGACAATTTTGTAC 

TTTGTCGCAATTATCGTAAATTACGATATTGATCGTATTCGTCTCTTTTTAGAGAAAAAAGAAAAATAA 
4178.2 

ATGAATAAATCAGAACACCGCCACCAACTTATACGCGCTCTTATCACAAAAAACAAGATTCATACACAGGCTGAG 

TTGCAAGCCCTTCnTGCTGAGAACGACATTCAAGTAACCCAGGCAACCCTCTCACGCGACATCAAAAATATGAAC 

CTATCAAAAGTCCGCGAAGAAGATAGCGCTTATTATGTTCTTAACAATGGTTCCATCTCAAAATGGGAAAAACGTC 

TCGAACTCTACATGGAAGACGCCCTTGTCTGGATGCGCCCAGTTCAACACCAAGTCCT^ 

ACTGGCTCAATCCnTTGGTTCTATCATTGATACTTTGAGCTTCCCTGACGCTATCGCTACCCTTTCT 

TCTGTCTTATCATCTGTGAAGATGCAGATACTGCTCAAAAGTGCTTTGAAGAACTGAAAAAATTCGCCCCACCATT 

TTTCTTTGAAGAATAA 

4179.1 

ATGAAAAGTATAAAATTAAATGCTCTATCHTACATGG 

AACCTATGTCGCGCGTGTCITGGACCGAACTGACTATGGTTACTTCAACTCAGTCGAC^CT 
TGCCCTTTGCAACTTATGGTGTCTATAACTACGGTTTAAGGGCTATCAGTAATGTCAAGGATAAC 
TAACAGAACCTTTTCTAGTCTTTTTTATTTGTGCATCGCTTGTACGA 
ATCCTCTCTTCTTTACTGATAATCCAATCGTCAAAAAGGTCTA 

TTTTCAATCG AATGGGTC AATG AAdCTCTGG AAAATTACAGTTTTCT C r 1 ' V 1 ACAAAACTGCCTTC ATCCGTATCCT 
G ATGCTGGTCTCTA 1" 1 TTCT 1' ATTTGTTAAAAATG AACACG ATATTGTTGTCTATACACTTGTG ATG AGTTTATCG A 
CGCTGATTAACTACCTGATTAGTTATTTTTGGATTAAAAGAGACATCAAACTTGT^ 
TAAACCACTCTTTCTCCCrCTGAC^GCCATGTTAGTCTTTGCCAATGCCAATATGCT 

TCnTCCTCGTTAAAACAGGGATTGATGTCAACGTTAGTTACTATACCATAGCTCAGCGAATTGTGACCGTTATAGC 
TGGGGTTGTAACAGGTGCAATTGGAGTGAGTGTGCCTCGTCTCAGTTACTATCTGGGGAAAGGAGACAAAGAAGC 
CTATGTTTCTCTGGTTAATAGAGGTAGTCGAATCTTTAACITCrr^ 

TAGGACCAAATGCCATCCTACTTTAOTGTAGTGAAAAATATATCGGAGGCGGCATCTTGACCT 
TCGTACGATTATCCTGGCCTTAGATACCATTCTTGGTTCCCAAATTCTCTTTACCAATGGCTATGAAAAACGTATC 
ACAGTCTAT ACAGTCTTTGCTGGG CTACTCAATTTGGGCTTG AAT AGTCTC C ITl ' l ' lT 1 'C AACC AT ATCGTGGCTCC 
TG AAT ACT ACTTACTG AC AACT ATGCT ATC AG AG ACTTCTCTACTTGTTTTCT AT ATC ATTTTC ATCC ATAG AAAAC 
AACTCATCCACnTGGGACATATCITrAGCrATACTGTTCGATACTCTCr Cl ^ 
TCCTGATTAATTTCGTGTATCCTGTAGATATGGTCATTAATTTGCCA rri ' l 1 GATTA 
ATACTGGTTTGATTGTCTTGCTATCAGCTATCTCTTATATTAGTCTACrTGTCTTCA 
G AATTTTT AAACC ATGTCCT AG CCTTAAAAAAT AAATTTAAAAAATC ATAG 

4179.2 

ATGAAACAACTAACCGTTGAAGATGCCAAACAAATTGAATTAGAAATTTTGGATTATATTGATACT 
AGCACAATATCAACTAT ATT ATT AACT ACGGTACTCTGATTGGGGCGGTTCGACATGAGGGCTTrATCCCTTGGGA 
CGACGATATTGATCTGTCCATGCCTAGAGAAGACTACCAACGATTTATTAACATTT 

GTATAAGCTCCTATCCTTAGAAACTGATAAGAACTACTTTAACAACTTTATCAAGATAACCGACAGTACGACT 

ATTATTGATACTCGAAATACAAAAACCTATGAGTCTGGTATCnTATCGATATTTTCCCTATAGAT^ 

TCCTAAGGTCATTGATACITGTTATAAACTGGAAAGCTTCAAACTGCTGTCTTTCAGTA 

TATAAGGATAGCCnTTTAAAAGATTGGATACGAACAGCCTTCTGGTTACTCCTTCGACCGGTTTCT 

TGCAAATAAAATCGAGAAAGAAATTCAAAAATATAGTCGTGAAAATGGGCAATATATGGCTTTTATCCCTTCAAA 

ATTTAAGGAAAAGGAAGTCTTCCCAAGTGGTACCTTTGATAAAACAATCGA 

CCTGCACCTGAAAAATTTGATACTATTTTGACACAATTITATGGAGATTATATGACCCTACCACCA 
GCTTCTACAGTCATGAATTTCACGCTTATAAATTGGAGGATTAG 

4179.3 

ATGATAAAAATCAATCATCTAACCATCACACAAAACAAAGATTTACGAGATCTTGTATCTGACCTAACCATGACC 
ATCCAAGACGGGGAAAAGGTTGCTATTATTGGTGAAGAAGGAAATGGCAAATCAACCITACITAAAATTTTAATG 
GGGGAAGCTTTGTCTGATTTCACTATCAAGGGAAACATCCAATCTGACTATCAGTCACTGGCCTACATTCCTCAAA 
AAGTCCCTGAGGACCTAAAAAAGAAAACTTTACACGACTACITCrTTTTAGATTCT 

CCrCTATCGTTrGGCGGAGGAATTGCATTTTGATAGCAATCGTTTCGCAAGTGACCAAGAGATTGGCAATCTATCA 
GGGGGCGAAGCTTTGAAAATTCAGCITATCCATGAGTTAGCCAAACCCITTG 

CAAATGACCTAGACCTTGAGACAGTTGATTGGCTAAAAGGCCAGATTCAAAAGACCAGGCAAACCGTTATTTTCA 
TTTCCCATGATGAAGACTTTCITTCTGAAACGGCAGACACrATTGTTCACT^ 

AGCGGAAACGCTAGTAGAGCATTTAGACTATGATAGCTATAGTGAGCAGAGAAAGGCTAATTTTGCCAAACAAAG 
TC AG C AAGCTG CT A AC A A CC AAAG AG CCT ACG AT AA AACC ATG G AAAAAC ATCGG AG AGTT AAG C AAA ATGT AG 
AAACTGCGCTTCGAGCTACCAAAGATAGTACTGCCGGTCGCCTATTGGCTAAAAAGATGAAAACTGTCCTCTCAC 
AAGAAAAACGCTACGAAAAGGCAGCTCAGTCCATGACTCAAAAGCCACHTGA^ 

TT TCAG ACATCCAACCATTACCAGCTTCTAAAGTCTTAGTCCAACTGGAAAAAGAAAATTTGTCCATTGACGACCG 
AGTT I TGGTTCAAAAACT AC AACTAACTGTCCGTG G CC A AG AAAAAATCG GT ATT ATCGG 

GCC AAAT GGTGTTGGGAAATCAACTCTGTTAGCCAAGTTACAGAGACTTCTGAATGATAAAAGAGAGATTTCACT 
TGGTTTTATGCCACAAGATTACCACAAAAAACTGCAATTGGATTTATCCCCAATAGCCTATCTCAGTAAAACTGGG 
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GAAAAAGAGGAACTACAGAAAATCCAATCTCACCTAGCTAGTCTCAATTTCAGTTATCCAGAAATGC AGCAT CAA 

ATTCGCTCCTTATCTGGCGGACAACAGGGAAAACTCCTGCTTTTGGATTTAGTCCTGCG 

TGCTGGATGAACCCACACGAAACnTTTCTCCCACHTCTCAACCCCAAATCAGAAAACT 

CGGTCTCATCACTGTTTCGCATGACCGTCGTTTCTTAAAAGAAGTCTGCTCGATCATCTATCGCATGACAGAACAC 
5 GGTTTGAAGCTAGTTAATTTAGAAGATTTATAA 

4179.4 

ATGAAACCAAAAACATTTTACAACTTGCITGCCGAGCAGAATCTTCCACTTTCGGACCAGCAAAAA 
GAACGTTATTTTGAGCTCTTGGTCGAGTGGAATGAGAAGATTAATTTGACGGCGATTACGGACAAGGAAGAAGTT 

10 TATCTCAAACATTTTTACGATTCGATTGCACCCATTCTTCAAGGTTTGATTCCCAATGAAACT 

TATCGGGGCTGGGGCAGGATTTCCTAGTCTACCAATGAAAATTCTCTATCCGGAGTTAGATGTGA 
TCACTCAATAAGCGCATCAACTTCCTACAACTCTTGGCTCAAGAACTGGATTTGAACGGAGT^ 
GACGTGCCGAAGATTTTGCCCAAGACAAGAACTTCCGTGCTCAATATGATTTTGTAACAGCTCGTGCGGTTGC 
TATGCAGGTCCTATCTGAATTGACTATTCCCTACCTTAAGGTTGGTGGCAAACTATTAGCACTCAAGGCT 

15 GCGCCTGAGGAATTATTAGAAGCTAAGAATGCCCTCAATCTCCTTTTTAGTAAGGTCGAAGACAATCTC 

CCCTACCGAATAGAGATCCGCGCTATATCACAGTGGTAGAAAAGAAAAAAGAAACACCAAATAAATATCCACGTA 
AGGCTGGTATGCCAAATAAACGCCCACTTTAA 

4179.6 

20 ATGAGTATTAAACTAATTGCCGTTGATATCGACGGAACCCTTGTCAACAGCCAAAAGGAAATCACTCCTGAAGT^ 
TTTCTGCCATCCAAGATGCCAAAGAAGCTGGTGTCAAAGTCGTGATTGCAACTGGCCGCCCTATCGCAGGCGTTGC 
CAAACTTCTAGACGACTTGCAGTTGAGAGACGAGGGGGACTATGTGGTAACCTTCAACGGTGCCCTTGTCCAAGA 
AACTGCTACAGGACATGAGATTATCAGCGAATCCTTGACTTATGAGGATTATCTAGATATGGAATTCCT 
AAGCTCGGTGTCCACATGCATGCCATTACCAAGGACGGTATCTATACTGCAAATCGCAATATCGGAAAATACACT 

25 GTACACGAATCAACCCTCGTCAGCATGCCTATCnTCTACCGTACCCCTGAAG 

AATGTATGTTTATCG ATG AACC AG AAATTCTCG ATG CTG CG ATTG AAAAAATTCC AG C AG AATTTTACG AG CG CT A 
CTCCATCAACAAATCTGCTCCTTTCTACCTCGAACTCCTTAAAAAGAATGTAGACAAGOT 

TTGGCTGAAAAACTCGGATTGACCAAAGATGAAACCATGGCAATCGGTGATGAAGAAAATGACCGTGCCATGCTG 
GAAGTCGTTGGAAACCCCGTTGTCATGGAAAATGGAAATCCAGAAATCAAAAAAATCGCCAAATACATCACCAAA 
30 ACAAATGACGAATCCGGCGTTGCCCATGCCATCCGAACATGGGTACTGTAA 

4179.7 

ATGACTTGGATTATTCTTGGAGTTATCGCTCTTATTGTTATTTT^ 

TCGTATGCAAACCAAGGAGGCTTGGAGTCAGATTGATGTTCAGTTGAAACGTCGCAATGACCTCTTGCCAAACTTG 
35 ATTGAGACTGTAAAAGGTTATGCCAAATATGAAGGTTCTACCCTTGAAAAGGTGGCAGAACTACGTA ACCAA GTG 
GCGGCAGCGACTTCACCAGCAGAAGCTATGAAAGCCAGTGATGCCCTCACTCGTCAGGTTTCAGGTAi TI I iGCAG 
TTGCAGAAAGCTATCCAGATTTGAAAGCTAGTGCTAACTTTC 

ATAAAATTTCrrACTCTCGTCAACTCTATAACAGTGTTGTCAGCAACrACAATGTAAAATT 
CAATATTATCGCTGGAATGTTTGGATTTAAAGCGGCAGATTTC 
40 AAAGTTGATTTTAGCGGTTTAGGTGACTAA 

4179.8 

ATGTTGTTTGATCAAATTGCAAGCAATAAACGAAAAACCTGGATTTTGTTGCTGGTATTTTTCCT 
TGTTGGTTATGCGGTTGGTTATCTCTTTATAAGATCTGGACTTGGTGGTTTGGTTATTGCACT^ 
45 TCTACGCTTTGTCTATGATTTTTCAATCGACAGAGATTGTCATGTCCATGAATGGAGCGCGTGAGGTG GATGA GCA 
AACGGCACCAGACCTCTACCATGTAGTGGAAGATATGGCTCTGGTCGCTCAGATTCCTATGCCCCGTGTTTTCATC 
ATTGATGATCCAGCCTTAAATGCCTTTGCGACAGGTTCTAATCCTCAAAATGCGGCT 

TACTAGCTATCATGAATCGTGAAGAACTAGAAGCTGTTATGGGACATGAAGTCAGTCATATTCGTAATTATGATAT 
CCGTATTTCGACTATTGCAGTTGCCCTTGCTAGTGCTATCACCATGCTTTCTAGTATGGCAGGTCGTATGATGTGGT 
50 GGGGTGGAGCAGGTCGCAGACGAAGTGATGATGACCGAGATGGAAATGGTCTTGAAATCATTATGCTAGTGGTTT 
CCCTACTAGCTATTGTACTGGCACCTCTCGCTGCAACCTTGGTTCAGCT^ 

GCAGATGCATCTAGTGTCGAGCTGACTCGCAATCCCCAGGGAATGATTAATGCCCTAGATAAGTTGGACAATAGC 
AAACCTATGAGTCGCCACGTCGATGATGCTAGCAGTGCCCTTTATATCAATGATCCTAAGAAAGGTGGGGGGTTC 
CAAAAACTCTTTTATACCCACCCACCTATCTCAGAACGGATTGAACGTTTAAAACAGATGTAA 

55 

4179.9 

ATGAAATTAAATATTCAAGAAATTCGTAAGCACrrCTGAAGGTTTGAACTITGAACAAACGT^ 
ACCTG CGTG C ACGT A ATC AAG AAATTTT AG ATGT AAAAG ATATCCTTGCAGTTGGG AAAGT ACA AT ATG AAG A CC 
GTATGTATTTCTTAGATTATCAACTATCTTATACCATTGTTCTTGCTTCGAGTCGC^ 
60 GTTGAATCTTATCCAGTCACGGAAGTTTTCATGGAAGGCGCAACTAACCAGCTAGATCAAGAAGTTTTAGATGATG 
ACTTGGTCTTGCCCATCGAAAATGGGGAGCTTGACCTTGCTGAGAGTGTATCAGACAATATCCTGCTAAACATT 
TATCAAGGTCTTGACGGCTGAAGAAGAAGCTGGTCAAGGATTTATCTCAGGAAATGACTGGCAAATCATGACAGA 
GGAAGAATACCAAGCTCAAAAAGCAGTAAAGAAAGAAGAAAACAGTCCTTTTGCTGGCTTACAAGGACTATTTGA 
CGGAGATGAATAA 

65 
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ATGGAGTTATTTATGAAAATCACAAACTATGAAATCT^ 

TGAAAGTGCTAGAATACGGTGAAAATGTTGATCAGGAGCnTTTGTTGGGTGATATTGCAGATATCTCAGGTTGCCG 

TAATCCAGCCGTTTTTATGGAACGTTATTTTCAGATAGACGATGCGCATTTGTCGAAAGAGTTTC^ 

5 TCTTTCTCTATTTTAGATGACTGTTATCCrTGGGATTTGAGTGAAATA^ 

GGGAAATCTTGACCTCCTGAAATTCCCGAAGGTAGCGGTCGTGGGCAGTCGTGCTTGTAGCAAACAGGGAGCTAA 

GTCAGTTGAAAAAGTCATTCAAGGCHTGGAAAATC 

AGCAGCTCATATGGCAGCTCTTCAGAATGGCGGAAAAACCATTGCAGTGATTGGAACAGGACTGGATGTGTTTTA 
TCCT AAAGCC AAT AAACG CTTG C AAG ACT AC ATCG G C AATG ACCATCTG GTTCT AAGTG AAT ATGG ACCTGGTG A 
10 ACAACCTCTGAAATTTCATTTTCCTGCCCGTAATCGCATCATTGCTGGACn^ 

GCTAAGATGCGTTCAGGTAGTCTCATTACGTGTGAGCGAGCAATGGAAGAAGGACGCGATGTCITrGCTATTCCTG 

GTAGCATTTTAGATGGACrATCAGACGGTTGCCATCATTTGATTCAAGAAGGAGCAAAATTGGTCACCAGTGGGC 

AAGATGTTCTTGCGGAATTTGAATTTTAA 

15 4181.1 

ATG AAACGTCAATT AGCCTTGGTCGTCTTTAGTGGTGGTCAAG ATTCAACAACCTGCC 1" I r l CTGGGTCATGCAAC 
ACTATGAAACAGTCGAAGCTGTCACCnTTGCCTACGGCCAACGTCATCACCTCGAAATTCAAATTACT 
CGCTAAGGAACAGGGCATTCGTCACCATATCCTCGATATGTCTCTGCTGGGACAAATCACTGCTCAGCCAGACT 
GCGACGATTCATATTTCCrACATTCCTGACAAGCTCTGTGTCGAGTCAAAATCCCTCAAACTATATCT 

20 ACCGAAACCACGGAGATTTCCACGAAAACTGTATCAACACCATCGGGAAAGACTTGGTCAACTTGCTAGACCCTC 
GCTATTTAGAAGTCrGGGGAAAATTCACrCCGCGCGGTGGCATTTCAATCGACCCCTACTACAACTACGGTAAGCA 
AGGAACTAAGTATGAGGGCnTGGCAGAACAACGCCTCTTCCAACACGACCTTTATCCAGAGAAAATTGACAACCG 

CTAA 



ATGACCGAAACGGTAGAAGATAAAGTAAGTCATTCAATTACTGGGCrTGATATCCrCAAGGGGATAGTTGCTGCG 
GGAGCTGTCATAAGTGGAACCGTTGCAACTCAAACGAAGGTATTTACAAATGAGTCAGCAGTACrrGAAAAAACT 
GTAG AG AAAACGG ATG CTTTGGCAACAAATG AT AC AGT AGTTCTAGGTACG ATATCTACAAGTAATTC AG CG AGT 
TCAACTAGTTTGTCAGCtTCAGAGTCGGCAAGTACATCTXjCATCTGAGTCAGCCTCAACCA 
30 CAAGTACAAGTGCATCAGAATCAGCAAGTACATCGGCrrCGACAAGTATTTCTGCATCATCTACTGTGGTAGGTTC 
ACAAACAGCTGCCGCTACAGAAGCAACTGCTAAGAAGGTCGAAGAAGATCGTAAGAAACCAGCTAGTGATTATGT 
AGCATCAGTTACAAATGTCAATCTCCAATCTTATGCrAAGCGACGCAAGCGTTCAGTGGATTCCATCGAGCAATTG 
CTGGCTTCTATAAAAAATGCTGCTGTrriTTCTGGCAATACGATTGTAAATGGCGCCCCT 

TAAACATTGCTAAAAGTGAGACAAAAGTTTATACAGGTGAAGGTGTAGATTCGGTATATCGTGTTCCAATTTACTA 
35 TAAATTGAAAGTGACAAATGATGGTTCAAAATTGACCTTTACCrATACGGTTACGTATGTGAATCCTAAAACAAAT 
GATCTTGGTAATATATCAAGTATGCGTCCnXjGATATTCTATCTATAATrCAGGTACTTGAACACAAACAATG 
CCCTTGGCAGTGATCTTGGTAAACCirCAGGTGTAAAGAACTACATTACTGACAAAAATGGTAGACAGGTTCT 
CTATAATACATCTACAATGACGACGCAGGGTAGTGGGTATACTTGGGGAAATGGTGCCCAAATGAATGG ITTCI 1 1 
GCTAAGAAAGGATATGGATTAACATCATCTTGGACTGTACCAATTACTGGAACGGA 

40 TACATCCITTACATTrACCCCTTACGCTGCTAGAACAGATAGAAT^ 

GTAGTTGAATCTAGCACGACCAGTCAGTCACTTTCACAGTCTAAGTCACTCTCAGTAAGTGCTAGTCAAAGCGCCT 
CAGCTTCAGCATCAACAAGTGCGTCGGCTTCAGCATCAACCAGTGCCTCGGCTTCAGCGTCAACCAGTGCGTCAG 
CTTCAGCAAGTACCAGTGCTTCAGTCTCAGCATCAACAAGTGCTTCAGCCTCAGCATCGACAAGTGCCTCGGCTTC 
AGCAAGCACATCAGCATCTGAATCAGCGTCAACCAGTGCTTCGGCrrCAGCAAGTACCAGTGCTrCAGCrTCAGC 

45 ATCAACCAGCGCCTCGGCCTCAGCAAGCACCTCAGCITCTGAATCGGCCTCAACCAGCGCCTCGGCCTCAGCAAG 
CACCTCAGCTTCTGAATCGGCCTCAACCAGCGCCTCAGCCTCAGCATCAACGAGTGCnTCGGCTTCAGCAAGCAC 
AAGCGCCTCXfGGTTCAGCATCAACGAGTACGTCAGCITCAGCGTCAACCAGTGCTTCAGCCTCAGCATCAACAAG 
TGCGTCAGCCTCAGCAAGTATCTCAGCGTCTGAATCGGCATCAACGAGTGCGTCTGAGTCAGCATCAACGAGTAC 
GTCAGCCTCAGCAAGCACCrCAGCrTCTGAATCGGCCTCAACCAGTGCGTCAGCCTCAGCATCGACAAGCGCCTC 

50 AGCTTCAGCAAGTACCAGTGCTTCAGCCTCAGCGTCGACAAGTGCGTCGGCCTCAACCAGTGCATCTGAATCGGC 
ATCAACCAGTGCGTCAGCCTCAGCAAGTACTAGTGCATCGGCITCAGCATCAACCAGTGCCTCGGCrTCAGCGTCA 
ACCAGTGCGTCAGCITC^GCAAGTACCAGTGCTTCAGTCTCAGCATCAACAAGTGCTTCAGCCTCAGCATCGACA 
AGTGCCTCGGCTTCAGCAAGCACATCAGCATCTGAATCAGCGTCGACAAGCGCCTCAGCTTCAGCAAGTACCAGT 
GCGTCAGCTTCAGCATCAACCAGCGCCrCGGCCrCAGCAAGCACCTCAGCTTCTGAATCGGCCTCAACCAGCGCC 

55 TCGGCCTCAGCAAGCACCTCAGCTTCTGAATCGGCCTCAACCAGCXJCCTCAGCCTCAGCATCAACGAGTGCT^ 

GCTTCAGCAAGCACAAGCGCCTCGGGTT<^GCATCAACGAGTACGTCAGCnTCAGCGTCAACCAGTGCTTCAGCC 
TCAGCATCAACAAGTGCGTCAGCCTCAGCAAGTATCTCAGCGTCTGAATCGGCATCAACGAGTGCGTCTGAGTCA 
GCATCAACGAGTACGTCAGCCTCAGCAAGCACCTCAGCTTCTGAATCGGCCTCAACCAGTGCGTCAGCCTCAGCA 
TCGACAAGCGCCTCAGCITCAGCAAGTACCAGTGCTTCAGCCTCAGCTCGACAAGTGCGTCGGCCTCAACCAGTG 

60 CATCTGAATCGGCATCAACCAGTGCGTCAGCCTCAGCAAGTACTAGTGCATCAGCTTCAGCATCAACGAGTGCAT 
CGGCTTCAGCATCAACCAGTGCCTCGGCnTCAGCGTCAACCAGTGCGTCAGCITCAGCAAGTACCAGTGCTTCAGT 
CTCAGCATCAACAAGTGCTTCAGCCTCAGCATCGACAAGTGCcTCGGCTTCAGCAAGCACATCAGCATCTGAATCA 
GCGTCGACAAGCGCcTCAGCTTCAGCAAGTACCAGTGCGTCAGCCTCAGCGTCGACAAGTGCGTCAGCCTCAGCA 
AGTACTAGTGCATCAGCTTCAGCATCAACGAGTGCATCGGCTTCGGCGTCAACCAGTGCATCAGAGTCAGCAAGT 

65 ACCAGTGCGTCAGCTTCCGCATCAACAAGTGCCTCGGCrTCAGCAAGCACCAGTGCGTCGGCTTCAGCAAGTACT 
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AGCGCCTCAGCCTCAGCCTCAACCAGTGCGTCAGCCTCAGCAAGTATCTCAGCGTCTGAATCGGCATCAACGAGT 
GCGTCCGCTTCAGCAAGTACTAGCGCCTCAGCCTCAGCGTCAACAAGTGCATCGGCTTCAGCGTCAACGAGTGCG 
TCTGAATCGGCATCAACGAGTGCGTCCGCTTCAGCAAGTACTAGCGCCTCAGCCTCAGCGTCAACAAGTGCATCG 
GCnTCAGCATCAACGAGTGCGTCCGCTTCAGCAAGTACTAGCGCCTCAGCCTCAGCGTCAACAAGTGCATCGGCT^ 
5 CAGCGTCAACGAGTGCGTCTGAGTCAGCATCAACGAGTGCGTCAGCCTCAGCAAGCACATCAGCrrTCTGAATCTG 
CATCAACCAGTGGGTCAGCCTCAGCATCGACAAGCGCCTCAGCTTCAGCAAGTACCAGTGCGTCAGCCTCAGCGT 
CGACAAGTGCGTCGGCTTCAGCAAGTACCAGTGCGTCAGCCTCAGCAAGTACCAGTGCGTCAGCCTCAGCGTCGA 
CAAGTGCGTCGGCCTCAACCAGTGCATCTGAATCGGCATCAACCAGTGCGTCAGCCTCAGCAAGTACTAGTGCAT 
CAGCTTCAGCATCAACGAGTGCATCGGCTTCAGCATCAACCAGTGCATCAGAGTCAGCAAGTACCAGTGCGTCAG 
1 0 TTCCGCATCAACAAGTGCCTCGGCTTCAGCAAGTACTAG 



tlOJ.l 

ATGGGGGTCGAAACTTGGTTTTATTCTAGCATCTGCTGGCTGGCCATCGGGCTTGG TTCCG TTTGG 

ACATGACTGCTGCTAATGGCGGTGGAGGCTTTTTACTAATCT 
15 CTGCTGGCTGAGTTrGCCCTTGGCCGTAGTGCTGGCGTTTCCGCTATCAAAAC CTTTGGA AAACT 
ACAAGTACAACTTTATCGGTTGGATTGGCGCaTTGCCCTCnTTATCCT 

TGGATTCTAGTCTATCTAGGTATTGAGTTTGGGAAATTGTTCCAACTTGGTGGAACGGGTGATTATGCTCAGTTAT 
TTACrTCAATCATTTCAAATCCAGCCATTGCCCTAGGAGCTCAAGCGGCCTITATCCTA 
TCACGTGGGGTTCAAAAAGGGATTGAAAGAGCTTCGAAAGTCATC^ 
20 TCATCGGTCGCTCTCTCAGTTTGCCAAATGCCATGGAAGGGGTTCTTTACTTCCTCAAACCAGAcrr 
GACTAGCACTGGTCTCCTCTATGCTCTGGGACAATCTTTCTTTGCCCTCTCACTAGGGCT^ 
ATGCTTCITACITAGACAAGAAAACCAATCTAGTCCAGTCAGGAATCrCCATCGTAG 

CATCATGGCAGGTCTAGCCATTTTCCAAGCTCGATCCCCCTTCAATATCCAGTCTGAAGGGGGACCCAGCCTGCTC 
TTTATCGTCTTGCCTCAACTCTTTGACAAGATGCCTTTTGGAACCATTTTCTACGTCCT 

25 TTTGCGACAGTCACTTTTTCTGTCGTGATGCTGGAAATCAATGTAGACAATATCACCAACCAGGATAACAGCAAAC 
GTGCCAAATGGAGTGTTATTTTAGGAATTTTGACCTTTGTCT 
GCGGATGTTCACATTTTTGGTAAGACCTTCITrGACGCTATGGACTTCT^ 
AGCTCTCTACCTTTCACTITrrTACAGGCTATATCTTTAAAAAGGCTCTTGCAATGGA 
AGAGCATGGAAACAAGGACTGTTCCAAGTCTGGCTCTTCCnTCnTCGTTTCTTCGTTTCGTC 

30 ATTGTGGTCTTCATTGCCCAATTTATGTAATCAAAAAGGACTTGAGTAG 



ATGTTGAAAAAATGGCAGTTAAAAGATGTTATCITGCTC^ 

TTCAGGATATGTGTATAATATTCTCAGTCTACTCTTAACACCTCTTGGTTTGCAGGCCTTTG 
35 TCGGTCTCTGGTGTATGGCTGCGCCCATTGCTGCCATCTTTGTTCCGAGAGTCGGAAGTGCA ACGAT TGGAGAA 
GCTAGCTGCGCrTGCTGAAGTCCTTTATGGTAGCCAATTTGGTCTAGGAGCTCTTT^ 
TGGGAAGTGAATTTGGTTTTATCGTAACTAAGAATCGCTATGAAAGTTGGCTCTCTCTAACTGCT 
GATTACGCTTGTTAGCITTGTCTATGAATACATTAAGTTAGGTTACTACGCCTT^ 
GCTTGTGGTACGTTTTATTTCTGTTTATTTCTTCTGTACCATCTTGGTTCGTGCCATTGTC 
40 TGCAACTGGAGGAAAAGCATAG 



tlOJ.U _____ 

ATGGTCAAAGTAGCAACCCAGACACCGATTATCAGTCTCTTC-TGCTGATTTTATC 
TTCGATTGCTCTGACTCTTTCGGTAGTCGCATTTTGTATTCTCTTTATGCT 
45 CITGGATGATCATACTTGCCATTTTACCATCTTTTGCCAACTACTGGGCAG^ 
GGC^GTCATGCTTGGAACGAGGGCCTTTGTGAGAGTTTG 

AGCTTCTCTTGTACTTGGCTCAAAAGGGGCTATCACGCTCTTGGTCCTATGCCnTGATTGTGGT^ 
CCTCTCATTCAGCAAGAAATCAAGTCCCTCAAAG AAGCTTGCCTATTACGTGGTCAAGAACTACA 1 r i 1GGTCGC 
CCTTGATTTACAGTAAGGTTCTGATGACAGTCTTTAGGTGGCGCCATCTTTACCTGAGA 
50 ATATGACGAACATGCACAGTTGAAGAATAGCTATCGGACTTTTTATATTCCTAAAAA^ 
CTTTTCTTTTTATTGCTTCAAACCAGTCTATTTTTATAA 



tlOJ./ 

ATG AG AAAG C ACC AATT AC AAGTTC A C A A ATT A ACC ATTTT ATCT ATG ATG ATTG CCCTTG ATGT AGTCCTT AC AC 
55 CTATCTTTCGAATTGAGGGAATGGCACCGATGTCCAGTGTAGTCAATATTCTAGCAGGAATC ATGA TGGGACCTGT 
TTATGCCTTGGCTATGGCTACAGTCACAGCCm 
GGAGCGACTTTTGGAGCCCTTCTAGCAGGTCTCTTT^ 

AGATrrrGGGAACAGGTATTArrGGTTCCATTGTTTCCTATCCTGTTATGGTACT 
CTTAGCrGGTTTATCTACACGCCTCGATTT^ 
60 ATTTTT AATC A AG C AGG AATTCTTTA AAAA AGTG C AGGG AT ATTTCTTT AGTG A AAGG AT AG A CTG A 

4183.8 

ATGCAGGAATTTACAAATCCCTTTCCTATAGGCTCTAGTTCCCTCATTCACTGCATTA 
GATGCTGGCAAATGGGATTTTGGCTCTGGGATGCAAACCTGTCATC^ 
65 ACTAAGCAAAGTCAGGCTCTCTTCATCAATTTGGGGCATTTGTCA 
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GCTTCGTATGCAAACCAATCTTCTCTCCCGATC 

GCTTAGTTAAAGACCTTTTAGACTATAGACCTACGGTCCTTAAAGGAAACATGTCAGAAAT^ 

ATTAAAGCACCACGGCGTTGGGGTCGATGCGAGTGCTAAAGATCAAGAAACGGAGGATTTGCTTCAAGTCTTGAA 

AGACTGGTGTCAGACCTATCCTGGTATGTCTTTCTTAG 

GTCGCTGTACTGGGAAATGGCTGTACTGAATTAGACTGGATAACAGGGACAGGAGACITGGTTGGAGCCTTAAC 
GCTGTTTTTCTCAGCCAAGGAAAGACTGGTTTTGAAGCTTCTTGCTTAG 

GAAAATAGTTGTTCAAGGAATGGGATTGGAAGAATTTCGTTACCAAGTACTCAATCAGCTTTCGCTC^ 
AGATGAAAATTGGCTAGATACCATCAAAGGAGAGGTTTATGAATAG 



4185.3 

ATGAACCATAAAATCGCAATTTTATCAGATGTTCATGGCAATGCGACGGCGCTAGAAGCAGTGATTGCAGATGCT 

AAAAATCAAGGGGCCAGTGAATATTGGCTTCrGGGAGATATTTTTCTTCCTGGTC 

CCCTGCTAAAGGACCTTCCTATCACAGCAAGTGTTCGAGGCAATTGGGATGATCGTGTC 

GCAATATGGCTTAGAAGACCCACAGGAAGTTCAGCTCTTGCGTATGACACAGTATTTGATGGA GCGAA TGGATCC 
TGCAACGATTGTCTGGCTACGAAGCITGCCTTTGCnXjGAAAAGAAAGAA 

CATAATTTACCTGACAAAAACTATGGTGGTGACTTGCTAGTTGAGAATGATACAGAGAAATTTGACCAACTGCTA 

GATGCGGAAACGGACGTGGCAGTTTATGGTCATGTTCACAAGCAGTTGCTTCGTTATGGAAGTCAAGGGCAACAA 

ATCATCAATCCAGGGTCGATTGGCATGCCCTATTTTAATTGGGAGGCGTTAAAAAATCACCGTTCCCAGTATGCCG 

TGATAGAAGTTGAAGATGGGGAATTACTCAATATCCAATTTCGTAAAGTTGCTTATGATTACGAAGCTGAGTTAGA 

ATTGGCC A AGTCC AAGGGG CTTC C CTTT ATCG AAATGT ATG AAG AACTG CGTCGTG ACG A T AACTAT C A GGG G C A 

CAATCTG G A ATT ATTAG CC AG CTT AAT AG A A AAG C ATG G GT ATGT AG AGG ATGTG A AG AA' I * I ' n " 1"! TG ATTTTTTG 

TAA 

4186.1 

ATGAATGTAAATCAGATTGTACGGATTATTCCTACTTTAAAAGCTAATAATAGAAAATTAAATGAAAC 
TTGAAACCCTTGGAATGAAGGCCITGTTAGAAGAATCGGCCT^ 

GCTGGTTTrAGAAGAAGCTCCCAGTATGCGTACTCGTAAGGTAGAGGGAAGAAAAAAACTAGCTAGATTGATTGT 
CAAGGTGGAAAATCCCTTAGAAATTGAAGGAATCTTATCTAAAACAGATTCGATTCATCGATTATATAAAGGT 
AAATGCCTACGCnTTTGAAATTTTCTCACCAGAAGATGATTTGATTTTGATTCATGCGGAAGATGAC 
CTAGTAGAAGTAGG AGAAAAGCCTGAATTTCAAACAGATTTGGCATCAA 1'1'1'Cl' 1 ' 1 AAGTAAATTTGAG ATTTCTA 
TG GAATT AC ATCTCCC AACTG AT ATCG AAAGTTTCTTGG AATC ATCTG AAATTG G GG CATCCCTTG ATTTT ATTCC 
AGCTCAGGGGCAGGATTTGACTGTGGACAATACGGTTACCTGGGACre^ 

TTAGACATAGCAAGTCnTCGCCAGAAGTTTGAGTCTACTGAAT ATTTT ATTCCTAAGTCTGAAAAAT^ 
GT AA AG AT AG AAAT AATGTTG AATTGTG GTTTG AAG AAG T ATG A 

4186.2 

ATGAAGTGGACCAAGATTATTAAAAAAATAGAAGAACAAATCGAGGCAGGGATTTATCCCGGAGCCTCTTTTGCG 

T ATTTT AAGG AC AATC A ATG G AC AG AGTTCT ATTT AGG CC AG AGTG ACCC AG AG C ATG G CTTG C AG ACTG AGG C A 

GGACTAGTTTATGACCTAGCTAGTGTCAGCAAGGTTGTTGGGGTTGGCACAGTTTGTACCTTCrTGTC 

GTCAATTAGATATTGATAGACTGGTAATAGATTITTTACCTGAGAGTGATTATCCAGACATCACTATTCGCCAGCT 

CTTG ACTC ATG C AAC AG A CCTTG ATCCTTTT ATTCCTA ATCGTG ATCTTTT AAC AG CC CCTG A ATT A AAGGAAGCG 

ATGTTrCATCTCAACAGACGAAGTCAGCCAGCCTTTCTTTATTCGG 

GGAAAGAATTTTTAATCAAGATTTGGATGTGATTTTAAAGGATCAAGTCTGGAAACC^ 

TAAGTTTGGGCCAGTTGAGCTTGCTGTTCCAACAGTTAGAGGTGTAGAGGCAGGCATAGTGCATGATCCCAAGGC 
TCGTCTCCTCGGTAGACATGCTGGGAGTGCTGGTTTATT^ 

TATTT AGC AG ATG ATTTTG C AAG AG ACTTAAATCA AA ATTTTTCTCCTTTG G ATG AC AAGG AACGTTCTTT AGC AT 
GGAATTTGGAAGGAGATTGGCTAGACCATACGGGCTATACAGGTACCTTTATCATGTGGAATCGTCAGAAGCAAG 
AAGCCACTATTTTCCTATCGAATCGTACCTATGAAAAGGACGAGAGAGCTCAATGGATATTAGACCGCAATCAAG 
TGATGAACTTGATTCGCAAAGAAGAGTAA 

4187.2 

ATGATGAAGAAGACTTATAATCATATTTTGGTCTGGGGAGTCATTTTCTATA 

TACTCCTCAAGAACAATCTACCGTGGGAGTGGGAACTCCAGGTATTCAGCATCnTGGACGCCTGG'rrrrrClTTrG 

ACTCCTTTCAATTCTCTCTGGAAACTGGGCGAAGTGAGTGACATTGGACAATTATGTTGGA T 

TCCTCAATGTCTrCITGTTTTTTCCTCTGATTTTCCAACTCCT 

GTCCnTCTTTTTAGTTTTCTTGTGAGTCTTGGAATCGAGTGTACGCAATTAATCT^ 

CGCGTCTTTGAGATTGATGATTTGTGGACCAACACmGGGTGGCTATCTGGCTTGGCT^ 

ATAAAAACAAGGTAAGGAATTAA 
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ATG A AG ATTCCTCTCTTA ACTTTTG C AAGG C AT AAATTTGTTT ATGTCTTG CTT ACTTTG CTTTTTCTTG CTTTGGTT 
TATCGTGATGTTTTG ATGACTTA rri ' C ' l ' IT 1 ' i ' 1 G ATATTCATGCGCCCG ATCTAGCT AAATTC G ATGG ACAAGCAAT 
TAAAAATGACTTATTAAAATCAGCATTAGATTTTCGTATTCTCC^GTTCAATCTAGCT 
5 TTCCAATCATCATTGTTTTGCTAGGTTTTCAATATATTGAGCTGAAAAATAAAGTTIT 
AGAAGTGAGTTATCAAGGGTTAAAAAGAAAGTTGACTTTGCAAGTTGCAAGTATC^ 
ACTGTGCTGATAATTGCAATTATAACCTATTTCTTTGGGACTTTTTCTCCT 
GGAAGTGGTTTACAAAGACTCCTAGATGGAGAGATAAAAAGCTATTTGTTCTTTACTTGTGTC 
TTTTCATCAATGCAATCTATTTrTTACAAATAGTTGATTATGTGGGGAATGT^ 
10 ATG TTTCTTT GGCTTG<nTCTATGCTGCTTTATAGTGCCTTGCCTTACT 
AGCTAGCTATGGGGATG TAAG TTTGATGAAACTCTTTACTCCTTATATC 
AAAAATATGAAGATAATGTTTAA 

4188.2 

15 ATGAAGATAATGTTTAAGAATTTTAACAATATTTTGCTAAATAGAAAGATTGTTT^ 

GATG ATTTTGATAAACCATCTATTGTCAACAGCGGTTCAAAAGCAGG ATGCTGTTATci tTl i CA AGAG A G AATTG 
ATTTCAATTTTTTCCTATAATGACTATTCTGAAGCGAATTTAGAAATCCCCAA 

CATGGTAGGATGGCTCTCTGTCATTTTACTTGAAAGTGATTTGGCAGACCATTACCATCACTTGATTCGCT 
TCAAGCTCCTTTTTCGATTATACAAGGAAACGATTGGTTGTCATTTCTAAATTTTTT 
20 GTTTCTTGGTTTACTTCCTCTAGGAATTCATTTCAAAACAGTCGCACTnTC IT I' TT ACTTG CTCA GTTAAT G ATGTT 

GTACTTACTACTGTCTTATCTG ATAGC ACTG ATTAGTGCG G GCGCTGGTTTTTCCTTTTTTCTCTA 1 1 i 1 1 i AGCATT 
TGTGGGACAAGAATGGATGATGGATCATATTGTAACAGTGTATTTAGTACTCTTAAGTTTATTAGTTATGTTGAT^ 

GTT AGTCG CTTGG A AG AG AAATTT AAG AAAGG AT AA 
25 4L88.5 

ATGGGC AAAGG AG AG ATGGGC AAAGG AGTTATTGGCTTGGAGTTCGACTCAGAAGTATTGGTCAACAAGGCTCCA 
ACCCTTCAATTGGCAAATGGTAAAACAGCGACTITCCTAACCCAGTATGATAGCAAGACCTTGTTGm 
ATAAGGAAGATATCGGACAGGAAATTATTGGTATAGCTAAAGGAAGCATCGAAAGTATGCATAATCTTCCTGTAA 
ATCTAGCAGGTGCCAGAGTTCCTGGCGGAGTAAATGGTAGCAAAGCAGCGGTGCATGAAGTTCCAGAATTTACAG 
30 GGGGAGTTAATGGTACAGAGCCAGCTGTTCATGAAATCGCAGAGTATAAGGGATCTGATTCGCTTGTAACTCTTAC 
TACAAAAAAAGATTATACTTACAAAGCTCCTCTTGCTCAGCAGGCACTTCCTGAAACAGGAAACAAGGAGAGTGA 
CCTCCTAGCrTCACTAGGACrAACAGCTTTCTTCCTTGGTCTGTTTACGCT 

4188.10 

35 ATGTTTAAAGTITTACAAAAAGTTGGAAAAGCTTTT ATGTT ACCTATAGCTATACTTCCT 

GGGGArTGGTGGTGCACTTTCAAACCCAACCACGATAGCAACTTATCCAATACTAGACAATAGTAi i I nCAATCA 
ATATTCCAAGTAATGAGCTCTGCAGGAGAGGTTGTATTCAGTAATTTGTCACTACTTCTCTGTGTGGGATTATGTA 
TTGGCITAGCGAAACGAGAT AAAGG AACCGCTGCGTTAGCAGGAGTAACTGGTTACnTAGTTATGACTGCAACGA 
TCAAAGCTTTGGTAAAACITTTTATGGCAGAAGGATCTGCAATTGATACTGGAGTTATTGGAG 
40 AATAGTrGCCGTATATTTGCACAACCGATATAACAATATTCAATTACCTTCCGCTTTAGGATTCTTTG GA 
CGCTTCGTTCCTATTGTTACATCGTrCTCTTCTATCTTGATTGGCTTTGTCTTCT 

CAACTTCTTGTITCTACAGGTGGATATATTrCTCAGGCGGGTCCAATTGGAACTTrTCTATATGGAi ri l i AATGAG 
ACTTTCTGG AGC AGT AGG CTTACATCATAT AATTTA CCCTATGTTTTGGTATACTG AACTTGGTGGTGTTG A A ACTG 
TTGCAGGACAAACAGTGGTTGGAGCTCAAAAAATATTTTTTGCTCAArrAGCCGATTrGGCC 

45 TACAGAACGAACAAGGTTTTTTGCAGGTCGTTTCTCAACA 

TACCATAGTGTTCCTAAAAATCGTCGTA AAAAATACGCGGGTTTGTTITTTGG AGTTGCT^ ITT l ATTAC 

CGGTATTACAGAACCAATTGAATTTATGTITCrATTC^ 

GTGTTAGCTTCnTTATTGCAGACGTCTTAAATATTTCAATAGGAAACACATrTTCAGGA 
TTATTTGGAATTTTGCAGGGGAACGCTAAGACGAATTGGGTTCrrCAGATTCCATTrGG 

50 GTATTATATTATrTTTAGATGGTTCATTACTCAATTCAACGTTCTAACGCCAGGGCGAGGAGAAGAAGTAGA 

AAAGAAATTTCTGAATCCGCAGATTCAACTTCAAATACTGCAGATTATTTAAAACAGGATAGCCTACAAATTATCA 

GAGCCTTGGGTGGATCAAATAATATAGAAGATGTAGATGCTTGTGTGACACGTTTACGTGTAGCTGTAAAAGAAG 

TTAATCAAGTTGATAAAGCACTTTTAAAACAAATTGGTGCAGTTGATGTCTTAGAAGTGAAGGGTGGCATTCA^ 

^ AATCTATGGAGCAAAAGCAATCTTATATAAAAATAGTATTAATGAAATTTTAGGTGTAGATGATTAA 

4188.11 

ATGAAATTTAGAAAATTAGCTTGTACAGTACTTGCGGGTGCTGGGGTTCTTGGTCITGCT 

GCGGAAGTAAAGATGCTGCCAAATCAGGTGGTGACGGTGCCAAAACAGAAATCACTTGGTGGGCATTCCCAGTAT 
TTACCCAAGAAAAAACTGGTGACGGTGTTGGAACTTATGAAAAATCAATCATCGAAGCGTTTGAAAAAGCAAACC 

60 CAGATATAAAAGTGAAATTGGAAACCATCGACTTCAAGTCAGGTCCTGAAAAAATCACAACAGCCATCGAAGCAG 
GAACAGCTCCAGACGTACTCTTTGATGCACCAGGACGTATCATCCAATACGGTAAAAACGGTAAATTGGCTGAGT 
TGAATGACCTCTTCACAGATGAATTTGTTAAAGATGTCAACAATGAAAACATCGTACAAGCAAGTAAAGCTGGAG 
ACAAGGCTTATATGTATCCGATTAGTTCTGCCCCATTCTACATGGCAATGAACAAGAAAATGTTAGAAGATGCTGG 
AGTAGCAAACCTTGTAAAAGAAGGTTCGACAACTGATGATTTTGAAAAAGTATTGAA AGCA CTTAAAGACAAGGG 

65 TTACACACCAGGTTCATTGTTCAGTTCTGGTCAAGGGGGAGACCAAGGAACAra 
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AGCGGTTCTGTAACAGATGAAAAAGTTAGCAAATATACAACTGATGATCCTAAATTCGTCAAAGGTCTTGAAAAA 
GCAACTAGCTGGATTAAAGACAATTTGATCAATAATGGTTCAC^ 

GCCAACGGTCAAACATCTTACACAATCCTTTGGGCACCAGCTCAAAATGGTATCCAAGCTAAACTTTTA^ 
AGTAAGGTAGAAGTGGTAGAAGTACCATTCCCATCAGACGAAGGTAAGCCAGCTCTTGAGTACCTTGTAAACGGG 
5 TTTGCAGTATTCAACAATAAAGACGACAAGAAAGTCGCTGCATCTAAGAAATTCATCCAGTTTATC GCAG ATGAC 
AAGGAGTGGGGACCTAAAGACGTAGTTCGTACAGGTGCTTTCCCAGTCCGTACTTCATTTGGAAAACTTTATG 
ACAAACGCATGGAAACAATCAGCGGCTGGACTCAATACTACTCACCATACTACAAC^ 
AAATGAGAACACTTTGGTTCCCAATGTTGCAATCTGTATCAAATGGTGACGAAAA^^ 
CCTTCACTGAAAAAGCGAACGAAACAATCAAAAAAGCTATGAAACAATAG 

10 

4188.12 

ATGCAATCTACAGAAAAAAAACCATTAACAGCCTTTACTGTTATTTCAACAATCATTTTGCTCTTGTTGACTG^ 
TGTTCATCTTTCCATTCTACTGGATTTTGACAGGGGCATTCAAATCACAACCTGATACAAT^ 
TGGTTCCCTAAAATGCCAACCATGGAAAACTTCCAACAACTCATGGTGCAGAACCCTGCCTTGCAATC 
15 AACTCAGTATTTATCTCATTGGTAACCATGTTCTTAGTTTGT 

AAAACGTTTCTATGGTCAACGCATTCTATTTGCTATCTTTATCGCTGCTATGGCGCT^ 
TACCATTGGTACGTATCGTCAACTTCATGGGAATCCATGATACTCTCTC 

GCCATTCGGTGTCTTCCTCATGAAACAGTTCAGTGAAAATATCCCTACAGAGTTGCTTGAATCAGCTAAAATCGAC 
GGTTGTGGTGAGATTCGTACCTTCTGGAGTGTAGCCTTCCCGATTGTGAAACCAGGGTTTGCAGCC 
20 TTACCTTCATCAATACTTGGAATGACTACTTCATGCAATTGGTAATGTTGACTTCACGTAACAATTT^ 

CTTGG G GTTG CG A CC ATG CAGGCTG A A ATGG C AACC A ACT ATGGTTTG ATT ATGG C AGG AG CTG CCCTTG CTG CTG 
TTCCAATCGTCACAGTCTTCCTAGTCTITCCAAAAATCCrrTCACACAGGGTATTACTATGGGAGCGGTCAAAGGATA 
A 

25 4191.1 

ATGAAAAAAA LTriTnUlT ACTGCTCTTAGGClTG^ 

ATAAACTCTTATCAAGGGGATTTGTATATTCATGCAGACAATACGGCAGAGTTTAGACAGAAGATAGTTTACCAGT 
TTGAGGAGGACTTTAAGGGCCAAATCGTGGGACnTGGACGTGCTGGTAAGATGCCTAGCGGGTTTGACATTGACC 
CTCATCCAAAGATTCAGGCCGCGAAAAACGGTGCAGAACTAGCAGATGTGACTAGCGAAGTAACAGAAGAAGCG 
30 GATGGTTATACrGTGAGAGTCTATAATCCAGGTCAGGAGGGCGACATAGTTGAAGTTGACCTCGTCTGGAACTTA 
AAAAATTTACITTTCCTTTATGATGATATCGCTGAATTAAATTGGCAACCTCT 
AAAAGTTTGAATTTCATGTAAGGGGAGACAAGGGGGCTGAAAAACrCTrm 

AGGGAACGATTGAAAAGAGTAACCnTGATTATACrATCCGTTTAGACAATCrrCCGGCTAAGCGTGGAGTT^ 
GCATGCCTATTGGCCTCGGACCGATTTTGCTAGCGCTAGGGATCAGGGATC^ 
35 AATAAGATAGAAGACTCGATTGTTAGAGAAAAAGATCAGAGTAAACAACTCGTTACTTGGGTCCTCCCTTCGATC 
CTTTCCATCTCCTTGTTATTGAGTGTCTGCTTCTATTTTATTTATAGAAGAAAGACCACTCCT 

C AAAA ATC ATCGTCTCT ATG AA C C ACC A ATGG A ATT AG AGCCTATG GTTTT ATC AG AAG CAGTCT ACTCG ACCTCC 
TTGGAGGAAGTGAGTCCCTTGGTCAAGGGAGCrGGAAAATTCACCTTTGATCAACnTATTCAAGCTA^ 
ATGTGATAGACCGTGGGAATGTCTCTATCATTTCAGAAGGAGATGCAGTTGGTTTGAGGCTA GTAA AAGAAGATG 
40 GTTrGTCAAGCITTGAGAAAGACTGCCTAAATCTAGCTrTTTCAGGTAA^ 

TGCGGATTACAAGGTATCTGATAGTCTTTATCGTAGAGCCAAAGTTTCTGATGAAAAACGGATTCAAGCAAGAGG 
GCTTCAACTCAAATCTTCTTTTGAAGAGGTATTGAACCAGATGCAAGAAGGA 

GGGCTCCCAGATTATTATCGTCCTTTAACTGGTGGGGAAAAGGCCTTGCAAGTGGGTATGGGTG CCTTG ACTATCC 
TGCCCCTATTTATCGGATTTGGTTTGTTCTTGTACAGTTTAGACGTTCATGGCT^ 
45 CTTGGTTTTCTAGGGTTAGTTTTGTCTGTTTTCTATTATTGGAAGCTTCGA 

TGAAGCGGGAGCTGAGGTCTACTATCTCTGGACCAGTTTTGAAAATATGTTGCGTGAGA 
GCTGAACTGGAAAGTATTGTGGTCTGGAATCGCCTCrTGGTCTATGCG 

GTCATTTGATGAAGGTTCATCAGATTCAAGTGGAAAATCCAGATATCAATCTCTATGTAGCTTATGGCTGGCACAG 
TACGTTTTATCATTCAACAGCACAAATGAGCCATTATGCTAGTGTCGCAAATACAGCAAGCA CCTAC TCTGTATCT 
50 TCTGGAAGTGGAAGTTCTGGTGGTGGCTTCTCTGGAGGCGGAGGTGGCGGCAGTATCGGTGCCTTTTAA 

4191.2 

ATGAAAAAAGTAAGAAAGATATTTCAGAAGGCAGTTGCAGGACTGTGCTGTATATCTCAGTTGACAGCrrrri'CrT 
CGATAGTTGCTTTAGCAGAAACGCCTGAAACCAGTCCAGCGATAGGAAAAGTAGTGATTAAGGAGACAGGCGAAG 

55 GAGGAGCGCnTCTAGGAGATGCCGTCTTTGAGTTGAAAAACAATACGGA 

AGGCGCAAACAGGAGAAGCGATATTTTCAAACATAAAACCTGGGACATACACCTTGACAGAAGCCCAACCTCCAG 
TTGGTTATAAACCCTCTACTAAACAATGGACTGTTGAAGTTGAGAAGAATGGTGGGACGACTGTCCAAGGTGAAC 
AGGTAGAAAATCGAGAAGAGGCTCTATCTGACCAGTATCCACAAACAGGGACTTATCCAGATGTTCAAACACCTT 
ATCAGATTATTAAGGTAGATGGTTCGGAAAAAAACGGACAGCACAAGGCGTTGAATCCGAATCCATATGAACGTG 

60 TGATTCCAGAAGGTACACTTTCAAAGAGAATTTATCAAGTGAATAATTTGGATGATAACCAATATGGAATCGAATT 
GACGGTTAGTGGGAAAACAGTGTATGAACAAAAAGATAAGTCTGTGCCGCTGGATGTCGTTATCTTGCTCGATAA 
CTCAAATAGTATGAGTAACATTCGAAACAAGAATGCTCGACGTGCGGAAAGAGCTGGTGAGGCGACACGTTCTCT 
TATTGATAAAATTACATCTGATTCAGAAAATAGGGTAGCGCTTGTGACTTATGCTTCCACTATCTTTGATGGGACC 
GAGTTTACAGTAG AAAAAGGGGTAGCAG ATAA AAACGGAAAGCG ATTG AATG ATTCTC IT IT ITGG AATTATGAT 

65 CAGACGAGTTTTACAACCAATACCAAAGATTATAGTTATTTAAAGCTGACTAATGATAAGAATGACATTC 
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TAAAAAATAAGGTACCTACCGAGGCAGAAGACCATGATGGAAATAGATTGATGTACCAATTCGGTGCCA uri i lA 
CTCAGAAAGCTTTGATGAAGGCAGATGAGATTTTCACACAACAAGCG 

ATATTACGGATGGTGTCCCAACTATGTCGTATCCGATTAATTTTAATCATGCTACG TTTGC TCCATCATATCAAAAT 
CAACTAAATGCATTTTTTAGTAAATCTCCTAATAAAGATGGAATACTATTAA 
5 GTGGAGAACATACAATTGTACGCGGAGATGGGCAAAGTTACCAGATGTTTACAGATAAGACAGTTTATGAAAAAG 
GTGCTCCTGCAGCTTTCCCAGTTAAACCTGAAAAATATTCTGAAATC 

TCCAATTAATGGTGGATATATTTGGCTTAATTGGAGAGAGAGTATTCTGGCnTATCCGTTTAATTCT 

AAATTACCAATCATGGTGACCCTACAAGATGGTACTATAACGGGAATATTGCTC CTGA TGGGTATGATGTCTTTAC 

GGTAGGTATTGGTATTAACGGAGATCCTGGTACGGATGAAGCAACGGCTACTAGTTTTATGCAAAGTAT^ 

10 AAACCTGAAAACTATACCAATGTTACTGACACGACAAAAATATTGGAACAGTTGAATCGTTATTTCCACACCATC 
GTAACTGAAAAGAAATCAATTGAGAATGGTACGATTACAGATCCGATGGGTGAGTTAATTGATTTGCAATTGGGC 
ACAGATGGAAGATTTGATCCAGCAGATTACACTTTAACrGCAAACGATGGTAGTCGCTTGGAGAATGGACAAGCT 
GTAGGTGGTCCACAAAATGATGGTGGTTTGTTAAAAAATGCAAAAGTGCTCTATGATACGACTGAGAAAAGGATT 
CGTGTAACAGGTCTGTACCTTGGAACGGATGAAAAAGTTACGTTGACCTACAATGTTCGTTTGAATGATGAGTTTG 

15 TAAGCAATAAATTTTATGATACCAATGGTCGAACAACCTTACATCCTAAGGAAGTAGAACAGAACACAGTGCGCG 
ACnTCCCGATTCCTAAGATTCGTGATGTGCGGAAGTATCCAGAAATCACAATTTC AAAAG AGAAAAAACTTGGTG 
ACATTGAGTTTATTAAGGTCAATAAAAATGATAAAAAACCACTGAGAGGTGCGGTCTTTAGTCTTCAAAAACAAC 
ATCCGGATTATCCAGATATTTATGGAGCTATTGATCAAAATGGCACTTATCAAAATGTGAGAACAGGTGAAGATG 
GTAAGTTGACCITTAAAAATCTGTCAGATGGGAAATATCGATTATTTGAAAATTCTGAACCAGCT 

20 CGTTCAAAATAAGCCTATCGTTGCCTTCCAAATAGTAAATGGAGAAGTCAGAGATGTGACITCAATCGTTCCAC 
GATATACCAGCGGGTTACGAGTTTACGAATGATAAGCACTATATTACCAATGAACCTATTCCTCCAAAGAGAGAA 
TATCCTCGAACTGGTGGTATCGGAATGTTGCCATTCTATCTGATAGGTTGCATGATGATGGGAGGAGTTCTATTAT 
ACACACGGAAACATCCGTAA 

25 

4191.3 

ATGAAATCAATCAACAAATTTTTAACAATGCnTGCTGCCTTATTACTGACAGCGAG 

CAGTTTTTGCGGCTGGGACGACAACAACATCTGTTACCGTTCATAAACTATTGGCAACAGATGGGGATATGGATA 
30 AAATTGCAAATGAGTTAGAAACAGGTAACTATGCTGGTAATAAAGTGGGTGTTCTACCTGCAAATGCAAAAGAAA 
TTGCCGGTGTTATGTTCGTTTGGACAAATACTAATAATGAAATTATTGATGAAAATGGCCAAACTCTAGGAGTGAA 
TATTGATCCACAAACATTTAAACTCTCAGGGGCAATGCCGGCAACTGCAATGAAAAAATTAACAGAAGCTGAAGG 
AGCTAAATTTAACACGGCAAATTTACCAGCTGCTAAGTATAAAATTTAT 

GGTG A AG ATGG AG CAACCTT AAC AGGTTCT AAAGC AGTTCCAATTG AAATTG AATT ACC ATTG AA CG ATGTTGTG 

35 G ATG CG C ATGTGT ATCC A A AAAAT AC AG AAG C AAAG CC AAAAATTG AT AAAGATTTCAAAGGT AAAGC AAATC C A 

GATACACCACGTGTAGATAAAGATACACCTGTGAACCACCAAGTTGGAGATGTTGTAGAGTACGAAATTGTTACA 
AAAATTCCAGCACTTGCTAATTATGCAACAGCAAACTGGAGCGATAGAATGACTGAAGGTTTGGCATTCA^ 
. GGTACAGTGAAAGTAACTGTTGATGATGTTGCACTTGAAGCAGGTC 
TTGATTTGAAATTAACAGATGCTGGTTTAGCTAAAGTGAATGACCAAAACGCTGAAAAAACTGTGAAAATC 

40 ATTCGGCAACATTGAATGACAAAGCAATTGTAGAAGTACCAGAATCTAATGATGTAACATTTAACTATGGTAATA 
ATCCAGATCACGGGAATACTCCAAAGCCGAATAAGCCAAATGAAAACGGCGATTTGACATTGACCAAGACATGGG 
TTGATGCTACAGGTGCACCAATTCCGGCTGGAGCTGAAGCAACGTTCGATTTGGTTAATGCTCAGACTGGTAAAGT 
TGTACAAACTGTAACITTGACAACAGACAAAAATACAGTTACTGTTAACGGATTGGATAAAAATACAGAATATAA 
ATTCGTTGAACGTAGTATAAAAGGGTATTCAGCAGATTATCAAGAAATCACTACAGCTGGAGAAATTGCTGTCAA 

45 GAACTGGAAAGACGAAAATCCAAAACCACTTGATCCAACAGAGCCAAAAGTTGTTACATATGGTAAAAAGTTTGT 
CAAAGTTAATGATAAAGATAATCGTTTAGCTGGGGCAGAATTTGTAATTGCAAATGCTGATAATGCTC 
TT AG C ACGT AAAG C AG AT A AAGTG AGTC A AG AAG AG AAGCAGTTGGTTGTT AC AAC AA AG G ATG CTTT AG AT AG A 
GCAGTTGCTGCTTATAACGCTCnTACTGCACAACAAGAAACTCAGCAAGAAAAAGAGAAAGTTGACAAAGCTCAA 
GCTGCTTATAATGCTGCTGTGATTGCTGCCAACAATGCATTTGAATGGGTGGCAGATAAGGACAATGAAAATGTT^ 

50 TGAAATTAGTTTCTGATGCACAAGGTCGCTTTGAAATTACAGGCCnTCnTGCAGGTACATATTACriTA 
AAAACAGCCrGCTGGTTATGCATTACTAACTAGCCGTCAGAAATTTGAAGTCACTGCAACTTCn^ 
GGACAAGGCATTGAGTATACTGCTGGTTCAGGTAAAGATGACGCTACAAAAGTAGTCAACAAAAAAATCACTATC 
CC AC A A ACGGGTGGT ATTG GT A C A ATT ATCTTTGCTGT AG CGGGGGCTGCG ATT ATGGGT ATTG C A GTGT A CG CAT 

^ ATGTTAAAAACAACAAAGATGAGGATCAACTTGCTTAA . 

4191.4 

ATGACAATGCAGAAAATGCAGAAAATGATTAGTCGTATCITVIUI'GTTATGGCTCTGTGTTTTTCTCTTGTATGGGG 
TGC AC ATGC AGTCC A AGCGC AAG A AG ATC AC ACGTTGGTCTTG C AATTGG AG AACTATC A G G AGGTG GTT AGTCA 
ATTGCCATCTCGTGATGGTCATCGGTTGCAAGTATGGAAGTTGGATGATTCGTATTCCTATGATGATCGGGTGCAA 

60 ATTGTAAGAGACITGCATTCGTGGGATGAGAATAAACriTCrTCm 

TTGAGAATCAGATTGAAGTATCTCATATTCCAAATGGTCTTTACTATGTTCGCTCTATTATCCAGACGGATGCGGT 
TTCTTATCCAGCTGAA rnC r r il 1GAAATGACAGATCAAACGGTAGAGCCTTTGGTCATTGTAGCGAAAAAAACA 
GATACAATGACAACAAAGGTGAAGCTGATAAAGGTGGATCAAGACCACAATCGCTTGGAGGGTGTCGGCTTTAAA 
TTGGTATCAGTAGCAAGAGATGTTTCTGAAAAAGAGGTTCCCTTGATTGGAGAATACCGTTACAGTTCTTCTGGTC 

65 AAGT AGGG AG AACTCTCT AT ACTG AT A AAA ATG G AG AG ATTITTGTG AC AAATCITCCTCTTGG G AACT ATCGTTT 
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CAAGGAGGTGGAGCCACTGGCAGGCTATGCTGTTACGACGCTGGATACGGATGTCCAGCTGGTAGATCATCAGCT 
GGTGACGATTACGGTTGTCAATCAGAAATTACCACGTGGCAATGTTGACnTATGAAGGTGGATGGTCGGACCAA 
TACCTCTCTTCAAGGGGCAATGTTCAAAGTCATGAAAGAAGAAAGCGGACACTATA 

AAGGAAGTAGTTGTAACATCAGGGAAAGATGGTCGTTTCCGAGTGGAAGGTCTAGAGTATGGGACATACTATTTA 
TGGGAGCTCCAAGCTCCAACTGGTTATGTTCAATTAACATCGCCTGTTTCCTTTACA^ 

AGGAACTGGTAACAGTGGTTA AAAA TAACAAGCGACCACGGATTGATGTGCCAGATACAGGGGAAGAAACCCTTG 
TATATCTTGATGCTTGTTGCCATTTTGTTGTTTGGTAG 

4191.5 

ATGAGCCACATATACTTATCTATTTTCACAAGTCTCTTGCTGATGCTAGGACTTGTCAATGTTGCT 
AATATTTACGCATCGGTATGGAAGCAGCATATGCTCCCTTTAACTGGACCCAGGATGATGATAGCAACGGAGCTG 
TCAAAATCGATGGGACCAATCAGTATGCCAACGGATACGATGTTCAAATCGCCAAGAAAATCGCTAAGGACTTAG 
GTAAAGAACCTTTGGTTGTTAAAACCAAGTGGGAAGGTCTAGTCCCTGCC 

TATCGCAGGTATGAGTCCAACTGCAGAACGCAAACAAGAAATTGCCTTTTCGAGCAGTTACTATACTAGCGAACC 
AGTTTTGCTTGTCAAAAAAGATTCTGCCTACGCAAGTGCTAAATCTTTGGATGACTTTAACG 
TCTCAACAAGGGGTCTACCTITATAACTTGATTGCACAAATCCCAGGTGCTAAAAAAGAAACAGCCATGG 
TTCGCTCAAATGCGACAAGCTCITGAGGCTGGTGTCATTG 

CTGAAGCTGCGAACTCTAAGTTCAAGATGATTCAAGTAGAACCTGGTTTCAAAACT 

TCGCTATCGGGC1TCGTAAAAATGACAATCGTATTAGCCAAATCAATGCCAGCATTGAAACCATTTCAAAAGATG 

ACCAAGTTGCCTTGATGGATCGTATGATCAAGGAACAACCTGCCGAAGCTACAACAACTGAAGAGACTA GCAGTA 

GTTTCITTAGCCAAGTTGCTAAAATTCTTTCTGAAAACTGGCAACAACTCTTGCGTGG 

ATCTCTATCGTCG G AACCATCAT AGGTCTCATTATTGG A CTTGCC ATTGGTGTCTTCCGTACTGCTCCTCTCTCTG A 
AAACAAAGTCATTTACGGCCTACAAAAACTAGTCGGCTGGGTTCTCAATGTCTACATTGAAATTTTCCGTGGTACG 
CCAATGATTGTTCAATCGATGGTTATCTACTATGGAACTGCCCAAGCTTTCGGGATCAACC 

TTGACCGTACACTGGCTGCTATCTTCATCGTTTCAATCAATACCGGTGCCTACATGACTGAAATCGTCCGTGGTGG 
TATCCTAGCAGTTGACAAGGGACAATTTGAAGCTGCGACTGCTCTTGGTATGACCCATAACCAGACCATGCGTAA 
G ATTGTCCTACCTC AGGT AGTCCGTAAC ATCCTA CCTG C AACTGGT AATG AATTTGTCATCAATATCAAAG ATACA 
TCTGTATTGAACGTTATCTCTGTTGTCGAACTTTATTTCTCAGGAAATACCGTGGCAACACAAACCT 
CC^GACATTTACAATCATCGCCGTGATTTACTTrGTCCTCACCTTCACCGTAACACGTATCCTACGCTTTAT^ 
GC AG AATGG AC ATG G AT ACCTAC ACT AC AGGTGCT AACC AAATG CAAACGG AGG ATTTG AAAT AA 

4191.6 

ATGACACAAGCAATCCnTGAAATTAAACACCTCAAAAAATCCrATGGACAAAACGAAGTGCTAAAAGACATTTCA 
CTC ACTGTCC AC AAG G G AG AG GTC ATCTCT ATC ATCG G AAG CTCTGG AAG CGG AAAATCG ACCTTCCT ACG CTCC 
ATTAACCTACTTGAAACACCAACTGATGGACAAATCCTTTATCATGGACAAAACGTCCTCGAAAAAGGCTATGAC 
CTCACGCAATACCGTGAAAAGTTGGGGATGGTTTTCCAATCCnTTAACCTCT 

ACACAATCGTCGCTCAGACAACTGTCCTAAAACGCGAACGCACAGAAGCTGAAAAGATTGCCAAAGAAAACCTG 

GAAAAGGTCGGCATGGGAGAACGCTACTGGCAAGCCAAACCAAAACAACTCTCAGGTGGTCAAAAACAACGTGT 

GGCCATCGCTCGTGCCCTCrCCATGAATCCGGACGCTATTCrCrrTGATGAACCAACATCAGCT 

ATGGTTGGAGAAGTCCTCAAAATCATGCAGGACCTGGCTCAGGAAGGCTTGACCATGATTGTCGTAACCCATGAA 

ATGGAATTTGCCCGTGATGTCTCTCACCGTGTTATCTTTATGGATAAGGGCGTGATCGCTGAAGAAGGTAAACCAG 

AAGACCTCTTCACCAATCCTAAAGAAGACCGAACAAAAGAGrrCCTTCAACGCTATCTCAAATAA 

4192.3 

ATG AAAAAGT ATC AACTTCT ATTC AAAAT AAGTG C AGTCTTCTCIT ACTT A mTI CGT ATTT AGTCTTTCTC AG CT 

GACGCTTATCGTCCAAAACTATTGGCAATTrrcnTCTCAGATAGGCAATTTA 

TACITITTATTGGACrrCATGATTGTGGTTCITGTTAAGACAGGCCATGGTTATCT 

ATGGCTTTGGTATTCGATTTTGACAGTATTAGTGCTAGTGTrCCAGATCTCTTT^ 

TTCAGTGAACTGCGGAAGGTTGGGCTGTATTGATTGGTTATAGTGGGACTA^ 

AGCCCTGTTCTTTCTGGTTCCACTGATGGAAGAATTGATTTATAGAGGATTACrc 

CGCGATTTGGTCTTGATTTGCTTCTTCCTTCrATTTTATTTGCT 

ATATCTTCGTCTTTGCAACAGTTGGAATCATCTTTGCTGGTTTGACCCGCT 

GCGGTGCATGTGATCAATAATATTGTAGCGACCnTCCCGTTTTTGCTCACTTTTCT 

4193.1 

ATGAACAAGAAACAATGGCTAGGTCTTGGCCTAGTTGCAGTGGCAGCAGTTGGACTTGCTGCATGTGGTAACCGC 
TCHTCTCGTAACGCAGCITCATCnTCT^ 

AC AAATC ATTC AACC A ATC AG CTTGGG AAGGTTTGC AG G CTTGG G GT AA AG AAC AC AATCTTTC A AAAG AT AACG 

GTTTCACrTACrrCCAATCAACAAGTGAAGCrGACTACGCTAACAACHTGCAACAAGCGG 

AATCTTCGGTGTTGGTTTTGCCCITAATAATGCAGTTAAAGATGCAGCAAAAGAACACACT 

rrGATTGATGATGTGATTAAAGACCAAAAGAATGTTGCGAGCGTAACTTTCGCTGATAATGAGTCAGGTTACcTTG 
CAGGTGTGGCTGCAGCAAAAACAACTAAGACAAAACAAGTTGGTTTTGTAGGTGGTATCGAATCTGAAGTTATCT 
CTCGTITTGAAGCAGGATTCAAGGCTGGTGTTGCGTCAGTAGACCCATCTATC 

TTCATTTGGTGATGCGGCTAAAGGTAAAACAATTGCAGCCGCACAATACGCAGCCGGTGCAGATATTGTTTACCA 
AGTAGCTGGTGGTACAGGTGCAGGTGTCTTTGCAGAGGCAAAATCTCTCAACGAAAGCCGTCCTGAAAATGAAAA 
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AGTTTGGGTTATCGGTGTTGATCGTGACCAAGAAGCAGAAGGTAAATACACTTCTAAAGATGGCAAAGAATCAAA 
CTTTGTTCTTGTATCTACTTTGAAACAAGTTGGTACAACTGTAAAAGATATTTCT 

TTCCCTGGCGGTCAAGTGATCGTTTACTCATTGAAGGATAAAGGGGTTGACTTGGCAGTAACAAACCT^ 
AAGGTAAAAAAGCTGTCGAAGATGCAAAAGCTAAAATCCTTGATGGAAGCGTAAAAGT 
5 TCCTGAAAAATAA 



4193.3 

ATGTCTAAAAAATTACAACAAATTTCGGTTCCCTrGATTTCTGTATTCCT^ 

GTGGATCTTCGGTTATGATGCTATTTGGGGCTACGAAGAATTGTTCTATACAGCCTTTGGCAGTCTGCGTGGGA^ 

10 GGAGAAATCTTCCGTGCTATGGGTCCTCTGGTCTTGATTGGTCTTGGTTTTGCC^ 

TAACGTCGG ACTTCCTGGTCAG G CTTTGGC AGGTTGG ATTCTCAGTGGTTGGTTTGCCCTGTCGCATCCAG ATATG 
CCCCGTCCCTTGATGATTCTAGCAACCATCGTGATTGCCTTGATTGCTGGTGGGATTGTCGGAGCGATTCCAGGTA 
TTCITAGGGCCTATCTAGGGACGTCAGAGGTTATTGTAACCATCATGATGAACTACATTGTCTTGTATGTAGGGAA 
TGCCnTTATCCATGCTTrCCCTAAAGACTTCATGCAAAGTACAGATTCGACCATT CGTG 

1 5 TATCAGACACCTTGGTTGGCTGAGTTGACTGGTAACTCACGGATGAATATTGGTAIl'llCl 1 1 GCCATCATTGCCGT 

TGCAGTTATTTGGTTCATGCTCAAGAAAACAACTCTT^^ 

CAGAATATGCTGGTATTTCTGCCAAGCGGACTATTATCCTATCTATGATTATTTCAGGTGCC^ 
GGAGCTGTTGAAGGTTTGGGAACCTTCCAGAACGTCTATGTTCAAGGTTCGTCATTAGCTATCGGATTTAACG 

TGGCGGTTAGTTTGCTTGCGGCCAACTCACCAATTGGTATACT 
20 GGGGCTCCTGGTATG AATGCGGCGCAGGTACCATCTG AGCnTGTCAGCATTGTAACAGCGTCTATTATC i i l; inG 

TCAGTGTTCATTACCTTATCGAACGCTTTGTCAAACCGAAAAAACAAGTTAAAGGAGGTAAGTAA 

4194.1 

ATGGGAGTGAAAAAGAAACTAAAGTTGACTAGTTTGCTAGGACTGTCTCTGTTAATCATGA CAGCC TGTGCGACT 
25 AATGGGGTAACTAGCGATATTACAGCCGAATCGGCTGATTTTTGGAGTAAATTGGTTTACT^ 
TTCGCTTTTTATCGTTTGATATTAGTATCGGAGTGGGGATTATTCTCIT^ 

CCAGTCnTTCAGGTGCAAATGGTGGCTTCTAGGAAAATGCAGGAAGCTCAGCCACGCATTAAGGCGCTTCGAGAA 
CAATATCCAGGTCGAGATATGGAAAGCAGAACCAAACTAGAGCAGGAAATGCGTAAAGTATTTAAAGAAATGGG 
TGTCAGACAGTCAGACTCTCTTTGGCCGATTTTGATTCAGATGCCGGTTATTTTGGCCCTGTT^ 
30 GAGTTGACTTTTTAAAGACAGGTCATTTCTTATGGATTAAC 

TTAGCAGCAGTATTCACCTTTTTAAGTACITGGTTGTCCAACAAAGCTTT 

CGATGATGTATGGGATTCCAGTCnTGATTTTTATCITTGCAGTTTATGCGCCAGGTGGAGTCGCCCTA 
AGTGTCTAATGCHTATCAAGTCTTGCAAACCTATTT 

GTAGTACAGGCACAAAAAGATTTGGAAAATAGAAAAAGAAAAGCCAAGAAAAAGGCTCAGAAAACGAAATAA 

35 

4194.4 

ATGGTTATCGATCCATTTGCTATCAACGAACTAGACTATTACTTAGTTTCACACITCCACACT 

CATACACAGCTGCAGCAATTCTCAATAATCCTAAGTTAGAGCATGTTAAGTTTATCGGTCCTTACCACTGTGGACG 

AATCTGGGAAGGATGGGGTGTTCCAAAAGAACGTATCATCGTTGTTAAACCAGGTGACACTATCGAATTAAAAGA 

40 TATGAAGATTCATGCAGTAGAATCATTTGACCGTACTTGCTTGGTAACTCTCCCAGTGAACGGTG CTGAT GAGACA 
GGCGGTGAACTTGCTGGCnTGGCTGTTACAGATGAAGAAATGGCTCAAAAGGCTGTTAACTATATCT^ 
CAGGTGGAACCATCTATCATGGTGCAGATTCTCACrTCTCAAACTATTTTGCAAAACATGGT^ 
TGATGTTGCTTTG AATAACTATGGTGAAAATCCGGTAGGTATCCAAGACAAAATG ACATCTATCGACci TC 1 1 CGT 
ATGGCAGAAAATCTGCGTACCAAAGTCATTATCCCAGTTCACTATGATATCTGGTCTAACrrCATGGCTTCTACTA 

45 ATG AG ATTCT AG AACTTTGG AAAATGCG AAAAG ATCG CTTG C AAT A CG ATTTCC ATCCATTTATCTG GG AAGTTGG 

CGGTAAGTACACITATCCTCAAGAT CAAC ACTTAGTAGAAT^ 
CAAGACTCTAACATTCAATTTAAAGCTTTGCTATAA 



50 

4196.2 

ATGTTCCTTTCAGGCTGGTTGTCTAGTTTTGCr^ 
TCCATTTTTAAATGCCriTGAAAGTGCTATTGCGGCTCCTTTGGT^ 
55 TTGTTTTGGCTTTGATTCCTGTGCGAAAATTAAAATCTTTGTTT^ 

AtGATTAAGGATATTGGTTATATTCGTACGGATTTGCCAGAGGGCTTTGACTTTACTATTTCGCGA^ 
GTATCATCTCAGGAATTGCCTGTCACTGGAGTTTTTCAGGTCTAGCTGTAGTAGGTGT^ 

T AT AAAGG AC AG A AGGTTGG C AAG AAA C AGGGCCITATTTTTCT AG GTTT AG CCTTG GG AACTCACl TCTI G TTT A 
ACTCTCCTTTTGTGGAGTTGGAAACAGAGTTGCCITTAGCGA 
60 TATCATGCTTATTGCTTTGTTGAGAAACACAATGAGTTGATGACCTAG 

4197.1 

ATGAAGGTGGAACCACGTTGCGACGTCCTTrCGAGGATGTCGCATTTTTTTATTAGGATACTAATTATGGAGTTGC 
AAGAATTAGTGGAGCGCAGTTGGGCAATCCGACAAGCTTATCACGAACTGGAAGTTAAGCATCATGATTCCAAGT 
65 GGACGGTAGAAGAAGACCTCTTGGCTTTATCTAATGATATrGGAAATTTCCAACGACTGGTGATGACAAAGCAAG 
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GACGCTACTATGATGAAACACCCTACACACTGGAACAAAAACTTTCA 

CTCAACGTTTGGATATAGAC^TTCTGACGGAAATGGAAAACTTCCT 

GGACTTGGAAGTAG 

5 4197.4 

ATGCrrGATTGGAAACAATTTTTTCTAGCCTATCTGCGCTCCCGTAGTCGTCTTTTT 

ATTTCrTGTCTTACTCTTTCAGTTTTTATTTGCCAGTCTAGG^ 1 I CI TGTGTTGCTTT 

GTAACCATATTATTTTTCACTTGGGACATATTGGTGGAAACGCAGGTCTATCGCCAGGAACTTCTCTATGGAGAGA 
GGGAAGCCAAGTCTCCTTTGGAAATAGCTTTAGCAGAAAAATTAGAAGCGCGTGAGATGGAACTCTATCAGCAGA 
10 GGTCAAAAGCAGAAAGAAAACTGACGGATTTGCTGGATTACTATACCTTGTGGGTCCATCAGATAAAGA CCCC CA 
TTGCAGCCAGTCAACTCTTAGTTGCAGAAGTGGTCGACCGCCAACTGAAGCAGCAGCTAGAACAGGAAATTTTCA 
AAATCGACTCCTATACCAACCTAGTTTTACAGTACCTGCGTlTAGAAAGTTTCCATGATGATT^ 
GGTTCAAATTG AGG ACTTGGTC AAGG AAATAATTCGTAAAT ATGCTC IT rTCTTI ATTC AAAAAGGCTTAAATGTC 
AATCTACATGACCTTGATAAAGAAATCGTGACGGATAAAAAGTGGCTGCTAGTGGTT ATTGA GCAAATCATCTCA 
15 AACAGTCTCAAGTACACCAAGGAAGGTGGTCTGGAGATTTATATGGATGACCAAGAGCTTTGTATCAAAGATACG 
GGAATCGGGATAAAAAACAGTGATGTCCTCCGAGTATTTGAACGTGGCTTTTCAGGATACAATGGCCGTTTGACCC 
AGCAGTCCTCTGGACTTGGCCTTTATCTATCTAAGAAAATTTCTGAAGAACTGGGGCACCAGATTCGTATCGAGTC 
TGAGGTCGGAAAAGGAACGACAGTGCGGATTCAGTTTGCTCAAGTGAACTTAGTCCTTGAGTAA 

4211.2- 

ATGGAACTTAATACACACAATGCTGAAATCTTGCTCAGTGCAGCTAATAAGTCCCACTATCCGCAGGATGAACTG 
CCAGAGATTGCCCTAGCAGGGCGTTCAAATGTTGGTAAATCCAGCTTTATCAACACTATGTTGAACCGTAAGAATC 
TCGCCCGTACATCAGGAAAACCTGGTAAAACCCAGCTCCTGAACrTT'TTTAACATTGATGACAAGATGCGCTTTGT 
GGATGTGCCTGGTTATGGCTATGCTCGTGTTTCTAAAAAGGAACGTGAAAAGTGGGGGTGCATGATTGAGGAGTA 
CTTAACGACTCGGGAAAATCTCCGTGCGGTTGTCAGTCTAGTTGACCTTCGTCATGACCCGTCAGCAGATGATGTG 
C AG ATGTA CG AATTTCTC AAGT ATT ATG AG ATTCC AGTC ATC ATTGTGG CG A C C AAGG CG G AC AAG ATTCCTCGTG 
GTAAATGGAACAAGCATGAATCAGCAATCAAAAAGAAATTAAACTTTGACCCGAGTGACGATTTCATCCT 
ATCTGTCAGTAAGGCAGGGATGGATGAGGCTTGGGATGCAATCTTAGAAAAATTGTGA 

4211.3 

ATGACAAAGAAACAACTTCACTTGGTGATTGTGACAGGGATGAGTGGCGCAGGGA AAACTG TAGCCATTCAGTCC 
TTCGAGGATCTAGGTrArTTCACCATTGATAATATGCCGCCAGCTCTCrTGCCTAAGrrTTTGC 
TAAGGAAGACAATCCTAAGTTGGCCTTGGTAGTGGATATGCGTAGCCGTTCni C 1 11 1 C AG AG ATTC AAG CTGTT 
TTGG ATG AGTTG G AAA ATC AAG ATGGTTTGG ATTTC AAAATCCTCTTTTTG G ATGCGGCTG AT AAGG AATTGGTCG 
CTCGTTACAAGGAAACCAGACGGAGTCACCCACTAGCAGCAGACGGTCGTATTTTAGATGGAATCAAGTTGGAAC 
GTGAACTCITGGCACCTTTGAAAAATATGAGCCAAAATGTGGTGGATACGACTGAACTCACrCCA CGTG AGcTGC^ 
CAAAACCCTTGCAGAGCAGTTTTCAGACCAAGAACAAGCCCAGTC^ 
AAGTATGGAATCCCGATTGATGCGGACTTGGTCTTTC^ 

GAGAAACCAAACGGGTGTGGATGAACCTGTTTATGATTATGTCATGAACCATCCTGAGTCAGAAGAC'ri'l'l ATCAA 
CATTTATTGGCCiTGATTGAGCCGATTCTGCCAAGTTACCAAAAGGAAGGTAAGTCCGTTTTGACCAT^ 
GATGTACGGGTGGACAAC^CCGTAGTGTGGCATTTGCTAAACGCTTGGCGCAGGACITATCCAAGAATTGGTCTGT 
TAATGAAGGGCATCGCGACAAAGACCGCAGAAAGGAAACGGTAAACCGTTCATGA 

4211.4 

45 ATG AG AAAAC C AAAG AT AA CGGTG ATTG GTGG AGGG ACTGG AAGTCCCGTC ATTCT AAAAAG TCTGCG GG AAAAA 

GATGTGGAAATCGCAGCrATCGTGACGGTGGCAGATGATGGTGGTTCTTCAGGTGAACTCCGAAAAAATATGCAA 
C^GTTGAC^CCGCCAGGTGATCTTCGTAATGTCCTTGTGGCCATGTCGGATATGCCTAAGTTTTATGAGAA 
rrCAGTATCGGTTCTCTGAGGATGCCGGAGCCTTTGCTGGCCATCCATTGGGAAATCTCATCATTGCTGG 
AGAAATGCAGGGTTCAACCrATAATGCCATGCAGTTATTGAGCAAATTTTTCCATACAACAGGGAAAATTTATCCr 

50 TCCAGTGACCATCCnTTGACCCTTCATGCAGTCTTTCAGGATGGGACAGAAGTGGCTGGAGAGAGTCATATTGTAG 
ACCATCGAGGCATAATTGACAATGTCTATGTGACCAATGCCCTAAACGATGATACGCCTCTGGCCAGCCGTCGAG 
TAGTGCAGACCATCCTTGAAAGTGACATGATTGTCCTAGGGCCAGGTTCCCTCTTTACCTCTATTTTGCCCAA 
CGTGATTAAGGAAATTGGGCGGGCTCrTTTGGAAACCAAGGCAGAAATTGCCTATGTCTGCAATATC^ TGACC CA 
ACGTGGGGAGACGGAACACTITACAG ATAGCGACCACGTGGAAGTCTTGCATCGTCACCrTGGTCGCCCl I ri ATC 

55 GACACTGTCTTGGTGAATATTGAAAAAGTGCCTCAGGAATACATGAATTCCAACCGTTTTGATGAATACT^ 
AAGTGGAACACGATTTTGTAGGTCrrTGTAAGCAAGTTTCGCGCGTGATTTCATCTAACITC^ 
TGGCGGTGCCTTCCACGATGGAGATTTGATTGTGGACGAGTTGATGCGCATTATACAGGTGAAAAAATGA 

4213.1 

60 ATGAAAAATTTGATAAAGTTGCTAATAATTAGATTGATTGTTAACTTAGCAGACAGTGTATT^ 
TGTGGCACGTTAGCAATAATTATTCITCGAGCATGTTCTTAGGAATATTTATTGCAGTAAATTA ^ 
TTACTAATCTTTTTTGGACCAGTTATTGACAGAGTAAATCCGCAAAAA 

AGCAGTGGCTGTAAT ATTTTT ATT ATT ATT AAACCAAATATCATTTTGGGTGATAATGAGT^ 

TAATGGCTAGCTCCATAAGTTACGTGATAGAAGATGTGTTGATTCCTCAAGTGGTAGAATATGATAAGATTGTATT 
65 TG C AAATTCTCTITTT AGT ATTTCGT AT AAAGT ATT AG ATTCT ATTTTT AATTC ATTCG C ATC ATTTTT A C AG GTG G 
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C AGTAGG ATTTATTTT ATTGGTT AAG AT AG AT ATAGGC ATATTTTT ACTTG CTCTATTTATATTGTTGTTGTTAAAA 

TTTAGAACTAGCAATGCGAATATAGAAAACrrTCTCTTTCAAATATTACAAGAGAG AAGTGTTG C 

TTT ATTTT AAATAAT AA ATTATTATTTAA AACCAGT A'l TIC 1 T 1' AACGCHT ATAA A CTITTT TTATTCATTTCAG ACA 

GTAGTTGTACCG ATITTTrCTATTCG ATATTTTG ATGGTCCG ATTTTTT ATGGT A I* rTTTTI AACTATTGCTG GTTTG 

GGTGGTATATTGGGAAATATGCTAGCGCCAATCGTAATAAAATATTTAAAATCGAATCAAA TTGTTGGTGTA TTTC 

TTTTTTTG AACGG CTC AAGTTGGTTAGT AG C AATTGTT AT AAAAG A CTAT ACTTT ATC ACTT ATTTT A r i 1 1 1 CGTTT 

GTTTT ATGTCT AAAGG AGTCTTC AAT ATT ATTTTT A ATTCGTTGT ACC A AC AAAT ACCTCC A CATC A Ac l ivi i GGT 

AGGGTAAATACTACC ATTG ATTCTATT A tTTC 111 1 GG AATGCCAATTGGTAGTTTAGTTGCAGG AACGCTTATTG A 

TTTGAATATTGAATTAGTGTTAATTGCTATTAGCATACCTTATTTTTTGl^ 

ATTGAAAGAATTTAGTATATATTAG 

4213.2 

ATGATGTCTAACAAAAATAAGGAAATTCrGATTTTTGCGATTCT 
ATrGCTGGCTTCTTTAATGCCATCTGCCATTGCAAA 

GTTCAAGGATAGATTGATCCAACAATGGAAGGAGATTAGAAAGACTAAAAGAAAArrCTTCT^ 
AGGATGGCTCTTTCTCATTCTGATGACTGTTGTCTTTGAATTTGTAT^ 

ATGGACAAGGTCTAAATCAGTCTAATATTCAAAGTACCTTTCAAGAACAACCACTACHXjATAG 

TGTCATTGGACCTCTGGTAGAAGAATTATTTTTCCGTCAGGTCTTATTGCATTACTTGCAGGA 

TACTAAGCATTATTCTGGTAGGACTTGTTTTTGCTCTGACTCA 

GCAGTrGGTTACTTAGGTGGAGGCCTTGCCTTTTCTATTATTTATGTGAAAGAA^ 

TACTTGTTCACATGTTAAGCAACAGCCTCTCCTTAATCATTTTAGCTATCAGTATAGTAAA^ 



4224.1 

TTGAAAAAGCCAATTATCGAATTCAAAAACGTCTCTAAAGTTITTGAAGACAGCAACACCAAGGTTCTCAAAG 

ATCAACTITGAGTTGGAAGAAGGGAAATTCTACACCCTTCTAGGTGCATCTGGTrCGGGGAAA 

ACATTATTGCAGGTTTACTGGATGCGACGACAGGAGATATCATGCTAGACGGTGTTCGTATCAATGATATTCCAAC 

CAACAAGCGCGACGTACATACCGTCTTCCAATCCTATGCCITGTTCCCACATATGAATGTGTTTGAAAATG 

TTTCCACTTCGCTTGCGTAAAATTGATAAGAAAGAAATCGAGCAGCGTGTAGCGGAAGTTCTCAAGATGGTTCAGT 

TGGAAGGTTATGAAAAACGTTCCATCCGCAAACITrCTGGAGGACAACGTCAGCGTGTGGCCATCGCCCGTGCTA 

TCATCAACCAACCCCGTGTGGTCTTGTTGGACGAGCCTTTATCAGCGCTGGACrTGAAATTGAGAACAGACATGCA 

GTACGAATTGCGTGAATTACAACAACGATTGGGCATTACCTTTGTCTTTGTCACTCACGATCAGGAAGAA 

GCCATGAGTGACTGGATTTTCGTTATGAATGATGGCGAGATTGTCCAGTCTGGAACCCCTGTGGACATCTACGATG 

AGCCAATCAACCACITTGTTGCCACCTTTATCGGGGAGTCAAACATCT^ 

GGTCGAATTTAACGGCAAACGCTTTGAAGCGGTTGATGGTGGGATGAAGCCAAATGAACCTGTTGAG GTCGT TAT 
TCGTCCAGAGGACITGCGCATTACCCrTCCTGAAGAAGGCAAGCTCCAAGTTAAGGTCGATACCCAGL Tl i I CCGT 
GGAGTTCATTATGAAATTATCGCCTATGACGAACTTGGAAATGAATGGATGATCCACTCAACCCGTAAGGCTATCG 
TGGGTGAGGAAATCGGTCTGGACTTTGAACCAGAAGACATCCACATCATGCGTCTCAATGAAACCGAAGAAGAGT 
TCG ATG CTCGT ATTG AGG AGT ACGT AG A A ATCG AAG AG C AAG AAG C AGGTTTG ATC A ATGC AATCG AGG AGG AAA 
G AG ATG AAG AA AAC AAG CTCT AA 

4252.1 

ATG AAATC AATG AG AATCTT ATTTTTGTT AG CTTT AATTC AAATC AGTTTG AGT AG CTGTTTCCT ATG G AAGG AAT 
GCATCTTGTCCTTTAAACAAAGTACAGCTTTTTTCATCGGAAGCATGGTT^ 

AAATT ATCTTTATACCCGTAAGC AAGAAGTCCATAGTGTCCTAGCCAGTA AG AAGTCGGTG AAGC ITTn ' 1 ACAGT 

ATGTTACTCTTAATTAATTTGTTAGGAGCTGTTCTTGTTTTGTCAGATAACT^ 

AGAATTAGTTGACnTTTTATTGCCATCCrrcril'ri'CCTATTTGGGCTAGATTTGCT 

ATACGTGCGCGA I I'in CMr GCTATGCTGGACAGAAAAAAGACAGTGTTGGTGACTATTTTAGCAACACriC 1C 
TTAAGAAATCCAATGACCATTGTCTCACTTCTG^ 

AAATTCGGTTAAGAAGGAAGTTTCCriTTATGGTCATATTTTCCGAGATCTTGTATTGGTCA 
TCTTTTAG 

4252.2 

ATGGTTAAAAAAATTATTGGAATGGTGCTAGCTTTACTTTCTGTAACT^ 

TATTTATCAACAAGGGACAGAAACCTTAGCTAAAACCTATAAAAAAATCGGTGAAGAAACCAA 
GACTGAACCTCTAACCATTCTGTTAATGGGAGTGGACACCGGAAATGTTGAACGAACT 

AAGTGATAGCATGATCTTGATGACAGTGAATCCTAAAACGAAAAAAACAACAATGATGAGTTTAGAGCGGGATAT 

TCTGACGCGCATTGAATCAGGGAATGGTCAGGCTCATGAAGCGAAACTGAACTCAGCATATGCAGATGGTGGAGC 

AGAGCTTGCTATAGAAACCATTCAAAAAATGATGAATATCCATATTGATCGCTATGTGATGGTCAATATGAGAGG 

ATTGCAAAAACTAGTGGATGCAGTAGGAGGTATTACAGTCAATAATATCCTAGGTTTCCCAATTTCTATCAGTGAC 

CAAGAAGAATTTAATACTATTTCTATCGGTGTTGGGGAGCAACATATTGGGGGAGAAGAAGCCCT 

CGAATGCGTTACCAAGATCCTGAGGGGGATTATGGTCGTCAAAAACGTCAACGTGAAGTTATTCAAAAAGTCATG 

GAAAAAGCTCTCAGTTTAAATAGCATTGGTCATTATCAAGAGATTCTAAAAGC^ 
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AATATTGATTTGTCTGCAAAAAGTATCCCTAACTTGCTA 

AGTTGCAGGGTGAAGGAGAGATACTTCAAGGTGTTTCTrACCAGATTGTrTCGAGAGCA CATA TGTTGGAAATGCA 

AAATCTACTCCGACGTTCTTTGGGACAAGAAGAAGTTACTCAGCTTGAAACCAATGCGGT^ 

TTTGGCAG AGC ACCTGTTGGTG ATG A AG ATAATT AA 

4256.2 

ATGAAAAAACAAGCCTATQTCATTATTGCTCTCACCTCCIT 

AATACTTGATTTTGACTGGTCTATCTTTTTGCACGATGTCGAAAAAACAGAAAAATTTGTCr 

TCAGCATGTCCATGACCTGTCTCTTAGCCCTGTTTTGGCGAGGGATCGAAGAGCTTTCTCTAAGAAAAATGCAGGC 
10 TAATCTGAAGCGTTTATTAGCAGGGCAAGAAGTGGTTCAGGTTGCAGATCCAGATTTGGATGCCAGTTTCAAGTC 
TTATCAGGTAAACTTAACCrrTTGACAGAGGCTCTTCAAAAAGCTGAAAATCAGAGCCTTGCTCAGGAAGAGGA^ 
ATCATCGAGAAGGAACGGAAGCGAATTGCTCGGGATTTGCACGATACAGTCAGTCAGGAGTTGTTTGCGGCCCAC 
ATGATTTTATCGGGTATCAGTCAGCAGGCTTTGAAATTGGATAGAGAAAAGATGCAGACCCAGTTGCAGAGTGTC 
ACAGCTATTTTAGAAACAGCCCAGAAGGATTTGCGGGTTTTGCTGTTGCATTTGCGACCAGTTGAACTG 
15 AGAGCTTGATAGAAGGGATTCAAATTCTTTTAAAAGAGCT^^ 

AGAATATGACGAAATTGCCTAAGAAAATCGAGGAGCATATCTTCCGTATCCTGCAAGAGTTGATTAGCAATACCC 
TCCGCCATGCCCAGGCATCTTGCCTAGATGTCTACCTCTATCAGACAGATGTTGAATTGCAACTGAAGGTGGTGGA 
CAATGGGATTGGTTTCCAGTTAGGGAGCTTAGACGACTTGAGTTATGGACTGCGAAATATCAAGGAGCGGGTTGA 
AGATATGGCTGGAACAGTrCAACTCTTGACAGCTCCCAAGCAAGGGCTGGCGGTTGATATCCGTATTCCCCTGTTA 

20 G ATAAG G AATG A 



ATGATTGTTTCCATTATTTCTCAAGGATTTGTCTGGGCTA 

AAACTTTCCAGATATGACGACAGAAGGTTCCTTCCCTCTTGGGGGAGCTGTTGCTGTCACTTTGATAAC CAAA 
25 GTGAACCCATTTTTAGCGACACTTGTTGCTGTAGGAGCAGGTTGTTTGGCT^ 

CAAAAGGGAAGATCCCAACCTTGCTCTCAGGGATTTTGGTGATGACTTCTTGTCACTCAATCATGCT 

GGGACGTGCGAATTTAGGCCTGCTTGGAACCAAGCAAATTCAGGATGTTTTGCCTTTTGATTCGGAT^ 

CTCTTGACAGGTCTCATCTTTGTGAGTATTGTTATTGCTCTCATGCTCTTTTTCTTGGACACT 

CTATATTGCTACAGGGGATAATCCTGATATGGCTAGAAGTTTCGGGATTCATACTGGACGCATGGAGCTCATGGGC 
30 TTGGTCTT ATCAAATGGTGTG ATTG C CCTTG C AGGTG CCCTC ATTGCTC AG C AA G AAG GTT ATG CCG ATGTGTCTC 

GAGGGATCGGGGTTATCGTTGTGGGGCTTGCAAGTTTGATTATTGGAGAAGTTATTTTCAAGAGTTTGAGC T^ 
AGAGCGTTTGGTTACTATCGTTGTAGGTTCTATCGCTTATCAATTTTTAGTGTGGGCAGTTATCGCACn^ 
ATACAAGTTACCTTCGTTTATACAGTGCCTTGATTTTAGCAGT 
CTTG AAAGG AG CC AAGTT AAGCAAATG A 

35 

4346.1 

ATGAAAAAAATGAAAGTTTGGTCTACTGTACTTGCAACGGGAGTTGCTCTTACTACACTTG 
GTTCAAATTCTACGACTGCTTCTTCATCTGAAGAAAAAGCTGATAAAAGTCAAGAATTAGTT 

AGTCTCAAATGGTCGTGGTGATTGGTTAACTGCTAAAGCAAAAGAAGCTGGTTTTAATATAAAAATGGTTGATATC 
40 GCTGGCGCTCAATTAGCAGACCGTGTTATTGCTGAGAAGAATAATGCAGTTGCAGATATGGTATTTGGAATTGGTG 
CTGTTGATrCAAATAAAATTAGAGATCAAAAATTACTAGTACAGTACAAGCCTAAATGGTTAGATAAAATTGATC 
AATCTTTATCAGATAAAGATAATTATTATAATCCTGTGATTGTTCAACCATTAGTTTTAATTG 
AAAGAAATGCCTAAAGATTGGACTGAATTAGGTAGTAAGTATAAAGGTAAATATTCAATTTCTGGTCITCAAGGA 
GGTACAGGACGGGCAATTCTAGCAAGTATCnTAGTTCGATACCTTGATGATAAAGGTGAATTAGGTGTTTCCGAAA 
45 AAGGTTGGGAAGTAGCAAAAGAATATTTGAAAAATGCATACACTCTTCAAAAGGGAGAAAGTTCAATTGTTAAGA 
TGTT AG A C AAAG AAG ATCC AAT AC AATATGG AATG ATGTGGGGTTCTG GTG C ATT AGTTGG AC AAAAAG AAC AAA 
ATGTTGTTTTC AA AGTT ATG ACTCCTG AG ATTG GTGTACCATTTGT AACTG AACAAACT ATG GTTTT AAG C A CT AG 
TAAAAAACAAGCGTTAGCTAAAGAATTTATTGATTGGTTTGGTCAATCAGAAATTCAAGTAGAATATAGTAAGAA 
CTTTGG ATCT ATTCCTGCAAAT AAAG ATG CCCTC AAAG ATCT ACCTG AAG AT ACG AAG AAATTTGTTG ATC AAGTG 
50 AAACCACAAAATATTGACTGGGAAGCTGTTGGAAAGCATTTGGATGAATGGGTAGAAAAAGCTGAATTAGAATAC 
GTACAATAA 

4346.2 

ATG ATT A AATTTG ATAAT ATTC AAATT A AATATGGTG ATTTTGTTG C AATTGATAATCTG A ATTT AG AT ATACATG 
55 AAGGGGAATTTTTTACATTTCnTGGGCCrrGAGGATGTGGTAAATCAACTACTITGAG 

AGATCCATCATCAGGAAGTATTGAAGTTAATGGAACAGATGTCACTCATTTGGAACCTGAAAAGCGTGGAATTGG 
TATTGTATTTCAATCrTATGCGCTATTTCCAACTATGACTGTTTTTGATAATA 

AGGTAGCTCCAGATGTTATTAAAGCTAAAGTATCAGCAGTGGCAGCAAAAATTAAGATCTCTGATCAACAGTTAC 
AGCGTAATGTATCAGAATTATCTGGGGGTCAACAACAAAGGGTAGCATTGGCTCGTGCTCTGGTTCTTGAACCTAA 
60 AATTCTTTGTCTAGATGAACCATTGTCAAACCTrGACGCAAAATTACGTGTAGATTTGAGAAAAGAGTT 
CTTCAAAAAGAGTTAGGTATTACTACTTTATATGTTACrCATGATCAAGAGGAAGCC^ 

TTGC AGTCTTT AAC AATGG AT A C ATCG AAC AG GTCG GT AC ACC AGT AG AG ATTT ATC AT AATTCTC AA ACTG AATT 
TGTATGTGATTTTATTGGAGATATTAATGTTTTGACCGATGAAACAGTCCACGAAGTATTATTGAAAAATA CAAGC 
GTTTTCTT AG AGG AT AAAAAAGG AT AC ATTCG ATT AG AG AAA GTTCG ATTC A ATCGTG A AACTG AAC AAG ATTTT A 
65 TTCTAAAAGGGACAATTATTGATGTTGAGTTTTCTGGAGTTACAATTCACTATACAATAAAAGTTTCT 



SUBSTITUTE SHEET (RULE 26) 



WO 00/06737 



71 



PCT/GB99/02451 



GATTCTTAATGTAACAAGTATTGATAGTCAGGCTGCT 
CCATCAGACGTTCTGCAATTTTAA 

4346 3 

5 ATGCGTCATAAATTAAATTTAAAAGATTGGCrrATTCGTTTAGGGTTAATCT^ 
TCCAAACTTTGATCTAGTAGTGAATGTATTTGTAAAAGGAGGAGAATT 
AAATCTCAGAGGGCACITCAGAGTATTATGAACAGTTTTAAGTTAGCATTTTCACTCAT^ 
TAGGTATTCTTTGTGTTCTATTTACAGAGTACTTTGATATTAAAGGTGCTAAAATTTTAuA 
TCTTTAATTTATGGAGGAGTGGTTTTAGCGACTGGATATAAATTTGTCTATGGTCCTTA 

10 TTTACAAAATGTT ATCC C V I C 1 1 4 ' AG ACCCT AACTGGTTTATTGGGTATGGTGC AGTCTTATTCATT ATG AC ATTTT 

CAGGAACTGCTAATCATACATTGTTTTTAACAAATACAATTCGAAGCG 
AAATATGGGAGCAAAACCATTTACTGTTTTCCGAAAAGTAGTGTTACCAACCTTAAT^ 
ACTATTATGGTTTTTCTTAGTGGTTTATCTGCAGTAGCAGCACCCATGATTGTTGGTGGTAAAGAAT^ 
AAATCCAATGATTATTACATTrGCAGGGATGGGGAATTCTCGTGATTTAGCTGCCCTACTTGCAATT 

15 ATTGCAACTACAATTTTGCTTACTATCATGAATAAGATAGAAAAAGGTGGAAATTATATTT 
AAGCGCCTCITAAAAAACAAAAAATTGCGTCTAAGC^ 
CACAGTTTTCATGCTTCCACTAATTTTTATAGTATTATACTCATTTACAGA 
TAACATTATCAAACTTTACITTAGAAAATTATCGCTTATTCrrTTAGTAATAGTGCG 
AGCITTATTTATTCTATTATTGGTGCGACAACAGCAACAATTCTCGCAGTTGTATTT 

20 TAAATCTCGTTTTGATTTCTTATTTGAATATGGTGCTCTACT 
GTTTATTATTTACTTTTAATCAGCCACAATTTCTTGTCT^ 

ATTGCATATATAGTTGTAAAAATCCCATTTTCTTATAGAATGGTACGTGCTATTTTAT^ 
GGAAGATGCAGCAAGAAGTATGGGTGCTTCACCTTTTTATACTATGATGAAGGTTATCATTCCA T^ 
GTTGTTCTCTCTGTTATTGCnTTAAACTTTAACTCrTTATT 
25 CT AGCTC AAC C ATT AGGT ATT ACG ATTCG ATCTG C AG GTG ATG AAAC AG C AAC ATCT A ATG C AC AAG CTCTGGT AT 

TTGTTTATACAATTGTTCTGATGATTATTT^ 
GAAATAA 
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Table 2 



MEELVTLDCLFIDRTKIEANAMCYSFVWKKTTE^ 

HYNDKEDSYTHPDGWYYRFHHTKYQKTQTDFQQEIKVYYADEPESAPQKGLYMNERYQNLKAKECQALLSPQGRQIF 
5 AQRKIDVEPVFGQIKASLGYKRCNLRGKRQVR1DMGLVLMANNLLKYSKMK2 

MGKGHWNRKRVYSIRKFAVGACSVMIGTCAVLLGGNlAGESVVYADETLrTHTAEKPKEEKMIVEEKADKALETKNIV 

ERTEQSEPSSTEA1ASEKKEDEAVTPKEEKVSAKPEEKAPRIESQASNQEKPLKEDAKAVTNEEVNQMIEDRKVDFNQN 

WYFKLNANSKEAIKPDADVSTWKKLDLPYDWSIFNDFDHESPAQNEGGQLNGGEAWYRKTFKLDEKDLKKNVRLTF 

10 DGVYMDSQVYVNGQLVGHYPNGYNQFSYDITKYLQKDGRENVIAVHAVNKQPSSRWYSGSGIYRDVTLQVTDKVHV 
EKNGTTILTPKLEEQQHGKVETHVTSKIVNTDDKDHELVAEYQIVERGGHAVTGLVRTASRTLKAHESTSLDAILEVER 
PKLWTVLNDKPALYELITRVYRDGQLVDAKKDLFGYRYYHWTPNEGFSLNGERIKFHGVSLHHDHGALGAEENYKAE 
YRRLKQMKEMGVNSIRTTHNPASEQTLQIAAELGLLVQEEAFDTWYGGKKPYDYGRFFEKDATHPEARKGEKWSDFD 
LRTMVERGKNNPAIFMWSIGNEIGEANGDAHSLATVKRLVKVIKDVDKTRYVTMGADKFRFGNGSGGHEKIADELDA 

15 VGFNYSEDNYKALRAKHPKWLIYGSETSSATRTRGSYYRPERELKHSNGPERNYEQSDYGNDRVGWGKTATASWTFD 
RDNAGYAGQFIWTGTDYIGEPTPWHNQNQTPVKSSYFGIVDTAGIPKHDFYLYQSQWVSVKKKPMVHLLPHWNWENK 
ELASKVADSEGKIPVRAYSNASSVELFLNGKSLGLKTFNKKQTSDGRTYQEGANANELYLEWKVAYQPGTLEAIARDES 
GKEIARDKITTAGKPAAVRLIKEDHAIAADGKDLTYIYYEIVDSQGNVVPTANNLVRFQLHGQGQLVGVDNGEQASRER 
YKAQADGSWIRKAFNGKGVAIVKSTEQAGKFTLTAHSDLLKSNQVTVFTGKKEGQEKTVLGTEVPKVQTIIGEAPEMPT 

20 TVPFVYSDGSRAERPVTWSSVDVSKPGIVTVKGMADGREVEARVEVIALKSE 

VEEYEVDKWEIAEEDKAKUUPGSRIQATGYLEGQPIHATLVVEEGNPAAPAVI^VTVGGEAVTGLTSQKPMQYRTLA 
YGAKLPEVTASAKNAAVTVLQASAANGMRASIFIQPKDGGPLQTYAIQFLEEAPKIAHLSLQVEKADSLKEDQTVKLSV 
RAHYQDGTQAVLPADKVTFSTSGEGEVAIRKGMLELHKPGAVTLNAEYEGAKDQVELTIQANTEKKIAQSIRPVNVVT 
DLHQEPSLPATVTVEYDKGFPKTHKVTWQAIPKEKLDSYQTFEVLGKVEGIDLEARAKVSVEGIVSVEEVSVTTPIAEAP 

25 QLPESVRTYDSNGHVSSAKVAWDAIRPEQYAKEGVFTVNGRLEGTQLTTKLHVRVSAQTEQGANISDQWTGSELPLAF 
ASDSNPSDPVSNVNDKLISYNNQPANRWTNWNRTNPEASVGVLFGDSGILSKRSVDNLSVGFHEDHGVGVPKSYVIEY 
YVGKTVPTAPKNPSFVGNEDHVFNDSANWKPVTNLKAPAQLKAGEMNHFSFDKVETYAVRIRMVKADNKRGTSITEV 
QIFAKQVAAAKQGQTRIQVDGKDLANFNPDLTDYYLESVDGKVPAVTASVSNNGLATVVPSVREGEPVRVIAKAENGD 
ILGEYRLHFTKDKSLLSHKPVAAVKQARLLQVGQALELPTKVPVYFTGKDGYETKDLTVEWEEVPAENLTKAGQFTVR 

30 GRVLGSNLVAEITVRNTTDKLGETLSDNPNYDENSNQAFASATNDIDKNSHDRVDYLNDGDHSENRRWTNWSPTPSSNP 
EVSAGVIFRENGKIVERTVTQGKVQFFADSGTDAPSKLVLERYVGPEFEVPTYYSNYQAYDADHPFNNPENWEAVPYR 
ADKDIAAGDEINVTFK^IKAKAMRWRMERKADKSGVAMIEMTFLAPSELPQESTQSKILVDGKELADFAENRQDYQIT 
YKGQRPKVSVEENNQVASTWDSGEDSFPVLVRLVSESGKQVKEYRIHLTKEKPVSEKTVAAVQEDLPKIEFVEKDLAY 
KTVEKKDSTLYLGETRVEQEGKVGKERIFTAINPDGSKEEKLREVVEVPTDRIVLVGTKPVAQEAKKPQVSEKADTKPID 

35 SSEASQTNKAQLPSTGSAASQAAVAAGLTLLGLSAGLVVTKGKKEDZ 

MKIMKKKYWTLAILFFCLFNNSVTAQEIPKNLDGNITHTQTSESFSESDEKQVDYSNKNQEEVDQNKFRIQIDKTELFVT 
TDKHLEKNCCKLELEPQINNDrVNSESNNLLGEDNLDNKIKENVSHLDNRGGNIEHDKDNLESSIVRKYEWDIDKVTGG 
GESYKLYSKSNSKVSIAILDSGVDLQNTGLLKNLSNHSKNYVPNKGYLGKEEGEEGIISDIQDRLGHGTAVVAQIVGDDN 
40 INGVNPHVNINVYRIFGKSSASPDWIVKAIFDAVDDGNDHNLSTGQYLMIDGEYEDGTNDFETFLKYKKAIDYANQKGV 
IIVAALGNDSLNVSNQSDLLKLISSRKKVRKPGLVVDVPSYFSSTISVGGIDRLGNLSDFSNKGDSDAIYAPAGSTLSLSEL 
GLNNFINAEKYKEDWIFSATLGGYTYLYGNSFAAPKVSGAIAMIIDKYKLKDQPYNYMFVKKFWKKHYQ2 

MKKTWKVFLTLVTALVAVVLVACGQGTASKDNKEAELKKVDFILDWTPNTNHTGLYVAKEKGYFKEAGVDVDLKLP 
45 PEESSSDLVINGKAPFAVYFQDYMAKKLEKGAGrTAVAArVEHNTSGIISRKSDNVSSPKDLVGKKYGTWNDPTELAML 
KTLVESQGGDFEKVEKVPNNDSNSITPIANGVFDTAWIYYGWDGILAKSQGVDANFMYLKDYVKEFDYYSPVUANND 
YLKDNKEEARKVIQAIKKGYQYAMEHPEEAADILIKNAPELKEKRDFVIESQKYLSKEYASDKEKWGQFDAARWNAFY 
KWDKENGILKEDLTDKGFTNEFVKZ 

50 MKRTWRNSFVTNLNTPFMIGNIEIPNRTVLAPMAGVTNSAFRTIAKELGAGLVVMEMVSDKGIQYNNEKTLHMLHIDE 
GENPVSIQLFGSDEDSLARAAEFIQEOTKTDIVDINMGCPVNKIVKNEAGAMWLKDPDKIYSIINKVQSVLDIPLTVKMR 
TGWADPSLAVENALAAEAAGVSAIJVMHGRTREQMYTGHADLETLYKVAQALTKIPFIANGDIRTVQEAKQRIEEVGA 
DAVMIGRAAMGNPYLFNQINHYFETGEILPDLTFEDKMKIAYEHLKRLINLKGENVAVREFRGLAPHYLRGTSGAAKL 

^ RGAISQASTLAEIETLLQLEKAZ 

MIKNPKLLTKSFLRSFAILGGVGLVIHIAIYLTFPFYYIQLEGEKFNESARVFTEYLFCTKTSDEIPSLLQSYSKSLTISAHLK 
RD1VDKRLPLVHDLDIKDGKLSNYIVMLDMSVSTADGKQVTVQFVHGVDVYKEAKNILLLYLPYTFLVTIAFSFVFSYF 
YTKRLLNPLFYISEVTSKMQDLDDNIRFDESRKDEVGEVGKQINGMYEHLLKVIYELESRNEQIVKLQNQKVSFVRGAS 
HELKTPLASLRI1LENMQHNIGDYKDHPKYIAKSINFGDQMSHLLEEVLESSKFQEWTECRETTVKPVLVD1LSRYQELAH 
60 SIGVTIENQLTDATRVVMSLRALDKVLTNLISNAIKYSDICNGRVHSEQDGYLSIKNTCAPLSDQELEHLFDIFYHSQIVTD 
KDESSGLGLYIVNNILESYQMDYSFLPYEHGMEFKISLZ 

MYLGDLMEKAECGQFSII^FLLQESQTTVKAVMEETGFSKATLTKYVTLLNDKALDSGLELAIHSEDENLRLSIGAATK 
GRDIRSLFLESAVKYQILVYLLYHQQFLAHQLAQELVISEATLGRHLAGLNQILSEFDLSIQNGRWRGPEHQIHYFYFCL 
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FRKVWSSQEWEGHMQKPERKQEIANLEEICGASLSAGQKLDLVLWAHISQQRLRVNACQFQVIEEKMRGYFDNIFYLR 
LLRKVPSFFAGQHIPLGVEDGEMMIFFSFLLSHRILPLHTMEYILGFGGQLADLLTQLIQEMKKEELLGDYTEDHVTYEL 
SQLCAQVYLYKGYILQDRYKYQLENRHPYLLMEHDFKETAEEIFHALPAFQQGTDLDKKILWEWLQLIEYMAENGGQ 
HMRIGLDLTSGFLVFSRMAAILKRYLEYNRFITIEAYDPSRHYDLLVTNNPIHKKEQTPVYYLKNDLDMEDLVAIRQLLF 

5 TZ 

MEFSKKTREL^IKKMQERTLDLUIGGGrTGAGVALQAAASGLETGLffiMQDFAEGTSSRSTKLVHGGLRYLKQFDVEV 
VSDTVSERAVVQQIAPHIPKSDPMLLPVYDEDGATFSLFRLKVAMDLYDLLAGVSNTPAANKVLSKDQVLERQPNLKK 

EGLVGGGVYLDFRNNDARLVIENIKRANQDGALIANHVKAEGFLFDESGK^ 
10 SDKVRNIJSNKGTQFSQMRPTKGVHLVVDSSKIKVSQPVYFDTGLGIXjRMVFVLPREr^YFGTTDTDYTGDLEHPK^ 
QEDVDYLLGIVNNRFPESNITIDDIESSWAGLRPLIAGNSASDYNGGNNGTISDESFDNLLATVESYLSKEKTREDVESAV 
SKLESSTSEKHLDPSAVSRGSSLDRDDNGLLTLAGGKrTOYRKMAEGAMERVVDILKAEFDRSFKLINSKTYPVSGGELN 
PANVDSEIEAFAQLGVSRGLDSKEAHYLANLYGSNAPKVFALAHSLEQAPGLSLADTLSLHYAMRNELTLSPVDFLLRR 
TNHMLFMRDSLDSIVEPILDEMGRFYDWTEEEKATYRADVEAALANNDLAELKNZ 

15 

MMNELFGEFLGTUULLGNGVVAGVVLPKTKSNSSGWWITMGWGIAVAVAVFVSGKLSPAYLNPAVTIGVALKGGLP 

WASVLPYILAQFAGAMLGQILVWLQFKPHYEAEENAGNILATFSTGPAIKDTVSNL1SEILGTFVLVLTIFALGLYDFQA 

GIGTFAVGTLIVGIGLSLGGTTGYALNPARDLGPR1MHSILPIPNKGDGDWSYAWIPVVGPVIGAALAVLVFSLFZ 

20 MTKKKIERISVIHREKILWLKWYFMRDKEQPKYSVLERKMFDAAKNQDMLAYQKYATIKQITDIRVQTSEADILEAVICE 
VYVYNHMNVIGACQRILFISQSPAYDKLNKWFNIYSDLYFSVVPLPKMGVYHEMVGIZ 

MKNSNEAEMKLLYTDIRTSLTEILTREAEELVAAGKRVFYIAPNSI^FEKERAVLEYl^QQASFSn-VTRFAQMARYLVL 
NDLPAKTTLDDIGLGLAFYKCLAELDPKDLRVYGAIKQDPQLIQQLIELYHEMTKSQMSFLDLENLTDEDKRADLLLIF 

25 EKVTAYLNQGQLAQESQLSHLIEAIENDKVSSDFNQIALVIDGFTRFSAEEERVVDLLHGKGVEIVIGAYASKKAYTSPFS 
EGNLYQASVKFLHHLASKYQTPAQDCSQTHEKMDSFDKASRLLESSYDFSELALDVDEKDRENLQIWSCLTQKEELEL 
VARSIRQKLHENSDLSYKHFRILLGDVASYQLSLKTIFDQYQIPFYLGRSEAMAHHPLTQFVESILALKRYRFRQEDLINL 
LRTDLYTDLSQSDIDAFEQYIRYLGiNGLPAFQQTFTKSHHGKFNLERLNVLRLRILAPLETLFASRKQKAEKLLQKWSV 
FUCEGAVTKQLQDLTTTLEAVEQERQAEVWKAFCHVLEQFATVFAGSQVSLEDFLALLHSGMSLSQYRTIPATVDTVL 

30 VQSYDLIAPLTADFVYAIGLTQDNLPKISQNTSLLTDEERQNLNQATEEGVQLLIASSENLKKNRYTMLSLVNSARKQLF 
LSAPSLFNESESKESAYLQELIHFGFRRREKRMNHKGLSKEDMGSYHSLLSSLVAYHQQGEMSDTEQDLTFVKVLSRVI 
GKKLDQQGLENPAIPTSPSSKTLAKDTLQALYPAKQEFYLSTSGLTEFYRNEYSYFLRYVLGLQEELRLHPDAR5HGNFL 
HRIFERALQLPNEDSFDQRLEQAIQETSQEREFEAIYQESLEAQFTKEVLLDVARTTGH1LRHNPAIETIKEEANFGGKDQ 
AFIQLDNGRSVFVRGKVDRIDRLKANGAIGVVDYKSSLTQFQFPHFFISTGLNSQLPTYLAALKREGEQNFFGAMYLEMA 

35 EPVQSLMAVKSLAGAVVEASKSMKYQGLFLEKESSYLGEFYNKNKANQLTDEEFQLLLDYNAYLYKKAAEKILAGRF 
AINPYTENGRSIAPYVQQHQAITGFEANYHLGQARFLEKLDLADGKRLVGEKLKQAWLEKIREELNRZ 



MKLIPFLSEEEIQKLQEAEANSSKEQKKTAEQIEAIYTSAQNILVSASAGSGKTFVMAERILDQLARGVEISQLFISTFTVK 
40 AATELKERLEKKISKKIQETDDVDLKQHLGRQLADLPNAAIGTMDSFTQKFLGKHGYLLDIAPNFRILQNQSEQLILENE 
VFHEVFEAHYQGKQKETFSHLLKNFAGRGKDERGLRQQVYKIYDFLQSTSNPQKWLSESFLKGFEKADFTSEKEKLTE 
QIKQALWDLESFFRYHLDNDAKEFAKAAYLENVQLFLDEIGSLNQESDSQAYQAVLARVVAISKEKNGRALTNASRKA 
DLKPLADAYNEERKTQFAKLGQLSDQIAILDYQERYHGDTWKLAKTFQSFMSDFVEAYRQRKRQENAFEFADiSHYTIE 
ILENFPQVRESYQERFHEVMVDEYQDTNHIQERMLELLSNGHNRFMVGDIKQSIYRFRQADPQrFNEKFQRYAQNPQEG 
45 RLULKENFRSSSEVLSATNDVFERLMDQEVGEINYDNKHQLVFANTKLTPNPDNKAAFLLYDKDDTGEEEESQTETKL 
TGEMRLVIKEILKLHQEKGVAFKEIALLTSSRSRNDQILLALSEYGIPVKTDGEQNNYLQSLEVQVMLDTLRVIHNPLQD 
YALVALMKSPMFGFDEDELARLSLQKAEDKVHENLYEKLVNAQKMASSQKGLIHTALAEKLKQFMDILASWRLYAKT 
HSLYDLIWKIYNDRFYYDYVGALPNGPARQANLYALALRADQFEKSNFKGLSRFIRMIDQVLEAQHDLASVAVAPPKD 
AVELMTIHKSKGLEFPYVFILNMDQDFNKQDSMSEVILSRQNGLGVKYIAKMETGAVEDHYPKTIKLSIPSLTYRQNEEE 
50 LQIJVSYSEQMRLLYVAMTRAEKKLYLVGKGSREKLESKEYPAAKNGKLNSNTRLQARNFQDWLWAISKVFTKDKLNF 
SYRFIGEDQLTREAIGELETKSPLQDSSQADNRQSDTIKEALEMLKEVEVYNTLHRAAIELPSVQTPSQIKKFYEPVMDM 
EGVEIAGQGQSVGKKISFDLPDFSTKEKVTGAEIGSATHELMQRIDLSQQLTLASLTETLKQVQTSQAVRDKINLDKILAF 
FDTVLGQEILANTDHLYREQPFSMLKRDQKSQEDFVVRGILDGYLLYENKIVLFDYKTDRYDEPSQLVDRYRGQLALY 
EEALSRAYSIENIEKYLILLGKDEVQVVKVZ 

MELARHAESLGVDAIATIPPIYFRLPEYSVAKYWND1SSAAPNTDYVIYNIPQLAGVALTPSLYTEMLKNPRVIGVKNSS 
MPVQDIQTFVSLGGEDHIVFNGPDEQFLGGRLMGARAGIGGTYGAMPELFLKLNQLIADKDLETARELQYAINAIIGKL 
TSAHGNMYGVIKEVLKINEGLNIGSVRSPLTPVTEEDRPVVEAAAALIRETKERFLZ 

60 

MYKTKCLREKLVLFLKIFFPILIYQFANYSASFVDTAMTGQYNTMDLAGVSMATSIWNPFFTFLTGIVSALVPIIGHHLG 
RGKKEEVASDFYQFIYLALGLSVVLLGMVLFLAPItLNHIGLEAAVAAVAVRYLWFLSIGIIPLLLFSVlRSLLDSLGLTKL 
SMYLMLLLLPLNSGFNYLLIYGAFGVPELGGAGAGLGTSLAYWVLLGISVLVLFKQEKLKALHLEKRIPLNMDKIKEGV 
RLGLPIGGTVFAEVA1FSVVGLIMAKFSPLIIASHQSAMNFSSLMYAFPMSISSAMAIVVSYEVGAKRFDDAKTYIGLGRW 
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TALIFAAFTLTFLYIFRGWASLYGNDPKFIDLTVRFLTYSLFFQI^DTFAAPLQGILRGYKDTVIPFYLGLLGYWGVAIP 
VYAIZ 

MSTIJUCIEALLFVAGEDGIRVRQLAELLSLPIHXjIQQ^ 
5 pINQSLSRAALETl^llAYKQPrTRIEIDAIRGVNSSGALAKLQAFDLIKEDGKK^ 
PVIDELEIQAQESQLFGERIEEDENQZ 

MDTMISRFFRHLFEALKSLKRNGW^VAAVSSVMrTLTLVAlFASVlFNTAKLATDIENNVRVVVYIRKDVEDNSQTIE 
KEGQTVTNNDYHKVYDSLKNMSTVKSVTFSSKEEQYEKLTEIMGDNWKIFEGDANPLYDAYIVEANTPNDVKTIAEDA 
10 KKIEGVSEVQDGGAOTERLFKUVSFIRVWGLGIAALLIFIAVFLISNTIRTTIISRSREIQIMRLVGAKNSYIRGPFLLEGAFIG 
LLGAIAPSVLVFIVYQIVYQSVNKSLVGQNLSMISPDLFSPLMIALLFVIGVFIGSLGSGISMRRFLKIZ 

MKKVRFIFLALLFFLASPEGAMASDGTWQGKQYUCEDGSQAANEWVFDTHYQSWFYIKADANYAENEWLKQGDDYF 
YLKSGGYMAKSEWVEDKGAFYYLDQDGKMKRNAWVGTSYVGATGAKVIEDWVYDSQYDAWFYIKADGQHAEKEW 

15 LQIKGKDYYFKSGGYLLTSQWINQAYVNASGAKVQQGWLFDKQYQSWFYIKENGNYADKEWIFENGHYYYLKSGGY 
MAANEWIWDKESWFYIJCFIX}KMAEKEWVYDSHSQAWYYFKSGGYMTANEWIWDKESWFYLKSTC 
SHSQAWYYFKSGGYNH'ANEWIWDKESWFYLKSIXiKlAEKEWVYDSHSQAWYYFKSGGYMAKNETVDGYQLGSDGK 
WLGGKTTNENAAYYQVVPVTANVYDSDGEKLSYISQGSVVWLDKDRKSDDKRLAITISGLSGYMKTEDLQALDASKD 
FIPYYESDGHRFYHYVAQNASIPVASHLSDMEVGKKYYSADGLHFDGFKLENPFLFKDLTEATNYSAEELDKVFSLLNI 

20 NNSLLENKGATFKEAEEHYHINALYLLAHSALESNWGRSKIAKDKNNFFGITAYDTTPYLSAKTFDDVDKGILGATKWI 
KENYIDRGRTFLGNKASGMNVEYASDPYWGEKIASVMMKINEKLGGKDZ 

MKKVLQKYWAWAFVVIPLLLQAIFFYVPMFQGAFYSFTNWTGLTYhmCFVGLNNFKLLFMDPKFMNAIGFTAIIAIAM 
VVGEIALGIFIARVLNSKIKGQTFFRAWFFFPAVLSGLTVALIFKQVFNYGLPAIGNALHIEFFQTSLLGTKWGAIFAAVF 
25 VLLWQGVAMPIHFLAGLQSIPTEITEAARIDGATSKQVFWNIELPYLLPSVSMVFILALKGGLTAFDQVFAMTGGGPNN 
ATTSLGLLVYNYAFKNNQFGYANA1AVILFFLIVVISIIQLRVSKKFEIZ 

MMKQDERKAUGKYILULGSVLILVPLLATLFSSFKPTKDIVDNFFGFCT^ 

LAVMIFIPMAAYSIARNMSKRKAFTIMYTLLILGIFVPFQVIMIPr^VMMSKLGLA^r^FGULLYLTYAIPQTLFLYVGYlKI 
30 SIPESLDEAAEIDGANQFTTYFRIIFPMMKPMHATTMIINALWFWNDFMLPLLVLNRDSKMWTLPLFQYNYAGQYFND 
YG PS FAS YVVG IISITIV YLFFQ RHIISGMS NG AVKZ 

MKSILQKMGEHPMLLLFLSYSTVISILAQNWMGLVASVGMFLFT1FFLHYQSILSHKFFRLILQFVLFGSVLSAAFASLEH 
FQIVKKFNYAFLSPNMQVWHQNRAEVTFFNPNYYGIICCFCIMlAFYLFTTTKLNWLKVFCVIAGFVNLFGLNFrQNRT 
35 AFPAIIAGAUYLFITIKNWKAFWLSIGVFAIGLSFLFSSDLGVRMGTLDSSMEER1SIWDAGMALFKQNPFWGEGPLTYM 
NSYPRIHAPYHEHAHSLYIDTILSYGIVGTILLVLSSVAPVRLMMDMSQESGKRPHGLYLSFLTVVAVHGIFDLALFWIQS 
GFIFLLVMCS1PLEHRMLVSDMTDZ 

MSKMDVQKIIAPMMKFVNMRGIIALKDGMLAILPLTVVGSLFLIMGQLPFEGLNKSIASVFGANWTEPFMQVYSGTFAI 
40 MGLISCFSIAYSYAKNSGVEALPAGVLSVSAFFILLRSSYIPKQGEAIGDAISKVWFGGQGIIGAIIIGLVVGSIYTFFIKRKIV 
IKMPEQVPQA1AKQFEAMIPAFVIFLSSMIVYILAKSLTNGGTFIEMIYSAIQVPLQGLTGSLYGAIGIAFFISFLWWFGVH 
GQSVVNGVVTALLLSNLDANKAMLASANLSLENGAHIVTQQFLDSFLILSGSGITFGLWAMLFAAKSKQYQALGKVA 
AFPAlFWNEPVVFGFPrVMNPVMFVPFILVPVUVAVIVYGAIATGFMQPFSGVTLPWSTPAILSGFLVGGWQGVITQLVI 
LAMSTLVYFPFFKVQDRLAYQNEIKQSZ 

45 

MKKKDLVDQLVSEIETGKVRTLGIYGHGASGKSTFA'QELYQALDSTTVNLLETDPYITSGRHLVVPKDAPNQKVTASLP 
VAHELESLQRDILACRRVWMSZ 

MKKRYLVLTALU^I^LAACSQEKTKNEDGETiaEQTAKADGTVGSKSQGAAQKKAEVVNKGDYYSIQGKYDEIIVAN 
50 KHYPLSKDYNPGENPTAKAELVKLIKAMQEAGFPISDHYSGFRSYETQTKLYQDYVNQDGKAAADRYSARPGYSEHQT 
GLAFDVIGTDGDLVTEEKAAQWLLDHAADYGFVVRYLKGKEKETGYMAEEWHLRYVGKEAKE1AASGLSLEEYYGF 
EGGDYVDZ 

MREPDFLNHFLKKGYFKKHAKAVLALSGGLDSMFLFKVLSTYQKELEIELILAHVNHKQRIESDWEEKELRKLAAEAE 
55 LPIYISNFSGEFSEARARNFRYDFFQEVMKKTGATALVTAHHADDQVETIFMRLIRGTRLRYLSGIKEKQVVGEIEIIRPFL 
HFQKKDFPSIFHFEDTSNQENHYFRNRJRNSYLPELEKENPRFRDAILGIGNEILDYDLAIAELSNNINVEDLQQLFSYSES 
TQRVLLQTYLNRFPDLNLTKAQFAEVQQILKSKSQYRHP1KNGYELIKEYQQFQICKISPQADEKEDELVLHYQNQVAY 
QGYLFSFGLPLEGELIQQIPVSRETSIHIRHRKTGDVLIKNGHRKKLRRLFIDLKIPMEKRNSALIIEQFGEIVSILGIATNNL 
SKKTKNDIMNTVLYIEKIDRZ 

60 

MRKFLIILLLPSFLTISKVVSTEKEVVYTSKEIYYLSQSDFGIYFREKLSSPMVYGEVPVYANEDLVVESGKLTPKTSFQIT 
EWRLNKQGIPVFKLSNHQFIAADKRFLYDQSEVTPTIKKVWLESDFKLYNSPYDLKEVKSSLSAYSQVSIDKTMFVEGRE 
FLHIDQAGWVAKESTSEEDNRMSKVQEMLSEKYQKDSFSrYVKQLTTGKEAGINQDEKMYAASVLKLSYLYYTQEKIN 
EGLYQLDTTVKYVSAVNDFPGSYKPEGSGSLPKKEDNKEYSLKDLITKVSKESDNVAHNLLGYYISNQSDATFKSKMSA 
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IMGDDWDPKEKLISSKMAGKFMEAIYNQNGFVLESLTKTDFDSQRIAKGVSVKVAHKIGDADEFKHDTGVVYADSPFIL 
SIFTKNSDYDTISKIAKDVYEVLKZ 

MKKQNNGLIKNPFLWLLFIFFLVTGFQYFYSGNNSGGSQQINYTELVQEITDGNVKELTYQPNGSVIEVSGVYKNPKTSK 
5 EETGIQFFTPSVTKVEKFTSTILPADTTVSELQKLATDHKAEVTVKHESSSGIWINLLVSIVPFGILFFFLFSMMGNMGGG 
NGRNPMSFGRSKAKAANKEDIKVRFSDVAGAEEEKQELVEVVEFLJCDPKRFr^ 

VAGEAGVPFFSISGSDFVEMFVGVGASRVRSLFEDAKKAAPAHFIDEIDAVGRQRGVGLGGGNDEREQTLNQLLIEMDG 
FEGNEGIIVIAATNRSDVLDPALLRPGRFDRKVLVGRPDVKGREAILKVHAKNKPLAEDVDLKLVAQQTPGFVGADLEN 
VLNEAALVAARRNKSIIDASDIDEAEDRVIAGPSKKDKTVSQKERELVAYHEAGHTIVGLVLSNARVVHKVTIVPRGRA 
10 GGYMIALPKEDQMLLSKEDMKEQLAGLMGGRVAEEHFNVQTTGASNDFEQATQMARAMVTEYGMSEKLGPVQYEG 
NHAMLGAQSPQKSISEQTAYEIDEEVRSLLNEARNKAAEIIQSNRETHKLIAEALLKYETLDSTQIKALYETGKMPEAVE 
EES HALS YDEVKSKMNDEKZ 

MKRSSLLVRMVISIFLVFLILLALVGTFYYQSSSSAIEATIEGNSQTT1SQTSHFIQSY 

15 SQDKVEGIRDLFLTILKSDKDLXTVVLVTKSGQVISTDDSVQMKTSSDMMAEDWYQKAIHQGAMPVLTPARKSDSQW 
VISVTQELVDAKGANLGVLRLDISYETLEAYLNQLQLGQQGFAFIINENHEFVYHPQHTVYSSSSKMEAMKPYIDTGQG 
YTPGHKSYVSQEKIAGTDWTVLGVSSLEKLDQVRSQLLWTLLGASVTSLLVCLCLVWFSLKRWIAPLKDLRETMLEIAS 
GAQNLRAKEVGAYELREVTRQFNAMLDQIDQLMVAIRSQEETTRQYQLQALSSQINPHFLYNTLDTIIWMAEFHDSQR 
VVQVTKSLATYFRLALNQGKDLICLSDEINHVRQYLFIQKQRYGDKLEYEINENVAFDNLVLPKLVLQPLVENALYHGI 

20 KEKEGQGHIKLSVQKQDSGLVIR1EDDGVGFQDAGDSSQSQLKRGGVGLQNVDQRLKLHFGANYHMKIDSRPQKGTKV 
EIYINRIETSZ 

MKRSSLLVRMVISIFLVFLILLALVGTFYYQSSSSAIEATIEGNSQTTISQTSHFIQSYIKKLETTSTGLTQQTDVLAYAENP 
SQDKVEGIRDLFLTILKSDKDLKTVVLNn-KSGQVISTDDSVQMKTSSDMMAEDWYQKAIHQGAMPVLTPARKSDSQW 
25 VISVTQELVDAKGANLGVLRLDISYETLEAYLNQLQLGQQGFAFIINENHEFVYHPQHTVYSSSSKMEAMKPYIDTGQG 
YTPGHKSYVSQEK1AGTDWTVLGVSSLEKLDQVRSQLLWTLLGASVTSLLVCLCLVWFSLKRWIAPLKDLRETMLEIAS 
GAQNLRAKEVGAYELREVTRQFNAMLDQIDQLMVAIRSQEETTRQYQLQALSSQINPHFLYNTLDTIIWMAEFHDSQR 
VVQVTKSLATYFRLALNQGKDLICLSDEINHVRQYLFIQKQRYGDKLEYEINENVAFDNLVLPKLVLQPLVENALYHGI 
KEKEGQGHIKLSVQKQDSGLVIRIEDDGVGFQDAGDSSQSQLKRGGVGLQNVDQRLKLHFGANYHMKIDSRPQKGTKV 

30 EIYINRIETSZ 

MFFKLLREALKVKQVRSKILFTIFrVLVFRIGTSITVPGVNANSLNAl^GI^FLNMLSLVSGNALKNFSIFALGVSPYrrASI 
VVQLLQMDILPKFVEWGKQGEVGRRKLNQATRYIALVLAFVQSIGITAGFNTLAGAQLIKTALTPQVFLTIGIILTAGSMI 
VTWLGEQrTDKGYGNGVSMIlFAGIVSSIPEMIQGIYVDYFVNVPSSRJnrSSIIFVnLirrVLLIIYFTTYVQQAEYKIPIQ^^ 
35 VAQGAPSSSYLPLKVNPAGVIPVIFASSrTAAPAAILQFLSATGHDWAWVRVAQEMLATTSPTGIAMYALUILFTFFYTF 
VQINPEfCAAETYKRVVPISMEFVLVKVQKNICLNFFVVLQLLVPSSLVZ 

MDIRQVTETIAMIEEQNFDIRTrTMGISLLDClDPDINRi^EKIYQKITTKAANLVAVGDEIAAELGrPIVNKRV 
AATDATDYVVLAKALDKAAKEIGVDFIGGFSALVQKGYQKGDEILINSIPRALAETDKVCSSVNIGSTKSGINMTAVAD 
40 MGRIIKETANLSDMGVAKLVVFANAVEDNPFMAGAFHGVGEADVHNVGVSGPGVVKRALEKVRGQSFDWAETVKK 
TAFKn'RJGQLVGQMASERLGVEFGIVDLSLAPTPAVGDSVARVLEEMGLETVGTHGTTAALALLNIXJVKKGGVMAC 
NQ VGG LS G A FIP VS ED EG M IAA VQ NG S LN LEKLE AMT AICS VG LD M IAIPEDTP A ETIAA MIA D E AAIG VINMKTT A V R1I 
PKGKEGDMIEFGGLLGTAPVMKVNGASSVDFISRGGQIPAPIHSFKNZ 

45 MTQIIDGKALAAKLCK3QLAEKTAKLKEETGLVPGLVVILVGDNPASQVYVRN1CERSALAAGFRSEVVRVPETITQEELL 
DLIAKYNQDPAWHGILVQLPLPKHIDEEAVLLAIDPEKDVDGFHPLNMGRLWSGHPVMIPSTPAGIMEMFHEYGIDLEG 
KNAVVIGRSNIVGKPMAQLLLAKNATVTLTHSRTHNLSKVAAKADILVVAIGRAKFVTADFVKPGAVVIDVGMNRDEN 
GKLCGDVDYEAVAPLASHITPVPGGVGPMTITMLMEQTYQAALRTLDRKZ 

50 MSKFNRIHLVVLDSVGIGAAPDANNFVNAGVPDGASDTLGHISKTVGLNVPNMAKIGLGNIPRETPLKTVAAESNPTGY 
ATKLEEVSLGKDTMTGHWEIMGLNITEPFDTFWNGFPEEILTKIEEFSGRKVIREANKPYSGTAVIYDFGPRQMETGELII 
YTSADPVLQIAAHEDIIPLDELYRICEYARSITLERPALLGRIIARPYVGEPGNFTRTANRRDLAVSPFFPTVLDKLNEAGI 
DTYAVGKINDIFNGAGINHDMGHNKSNSHGIDTLLKTMGLAEFEKGFSFTNLVDFDALYGHRRNAHGYRDCLHEFDE 
RLPEIIAAMRENDLLLITADHGNDPTYAGTDHTREYIPLLAYSPAFKGNGLIPVGHFADISATVADNFGVETAMIGESFL 

55 DKLVZ 

MFISISAGIVTFLLTLVEIPAFIQFYRKAQITGQQMHEDVKQHQAKAGTPTMGGLVFLITSVLVAFFFALFSSQFSNNVGM 
ILFILVLYGLVGFLDDFLKVFRKINEGLNPKQKLALQLLGGVIFYLFYERGGDILSVFGYPVHLGFFYIFFALFWLVGFSN 
AVNLTIXJVDGLASISVVISLSAYGVIAYVQGQMDILLVILAMIGGLLGFFIFNHKPAKVFMGDVGSLALGGMLAAISMA 
60 LHQEWTLLIIGIVYVFETTSVMMQVSYFKLTGGKRIFRMTPVHHHFELGGLSGKGNPWSEWKVDFFFWGVGLLASLLT 
LAILYLMZ 

LFKKNKDILNIALPAMGENFLQMLMGMVDSYLVAHLGL1AISGVSVAGNIITIYQAIFIALGAAISSVISKSIGQKDQSKLA 
YHVTEALKITLLLSFLLGFLSIFAGKEMIGLLGTERDVAESGGLYLSLVGGSIVLLGLMTSLGAHRATHNPRLPLYVSFL 
65 S N ALNILFSSL AIF VLDMGIAG V A WGTI VSRL VGL V IL WSQLKLP YG KPTFG LDKELLTLALP A AGERLMM RAG D VVIIA 



SUBSTITUTE SHEET (RULE 26) 



WO 00/06737 



PCT/GB99/02451 



76 



LVVSFGTEAVAGNAIGEVLTQFNYMPAFGVATATVMLLARAVGEDDWKRVASLSKC^FWLSLFU^LPLSFSnrVLGVP 
LTHLYTTDSLAVEASVLVTLFSLLGTPMTTGTVIYTAVWQGLGNARLPFYATSIGMWCIRIGTGYLMGIVLGWGLPGIW 
AGSLLDNGFRWLFLRYRYQRYMSLKGZ 

MQTQEKHSQAAVLGLQHLLAMYSGSILVPIMIATALGYSAEQLTYLISTDIFMCGVATFLQLQLNKYFGIGLPVVLGVA 

FQSVAPLIMIGQSHGSGAMFGALIASGIYVVLVSGIFSKVANLFPSrsn*GSVrrTIGLTLIPVArGNMGNNVPEPTGQSLL^ 

ArrVLnLLINIFrKGFIKSISIUGLVVGTAlAATMGLVDFSPVAVAPLVHVPTPLYFGMPTFEISSIVMMCIIATVSMVEST 

GVYIJ^LSDrrKDPIDSTRLRNGYRAEGLAVLLGGlFNTFPYTGFSQNVGLVKLSGIKKRLPfYYAAGFLVLLGLLPKFGA 

LAQIIPSSVLGGAMLVMFGFVSIQGMQILARVDFANNEHNFLIAAVSIAAGVGLNNSNLFVSMPTAFQMFFSNGIVVASL 

LAIVLNAVLNHKKKZ 



MKDRIKEYLQDKGKVTVNDI^QALGKDSSKDFRELIKTI^LMERKHQIRFEEDGSLTLEKKKHEITLKGIFHAHKNGFG 

FVSLEGEEDDLFVGKNDVNYAIEX5DTVEVVIKKVADRNKGTAAEAKIIDILEHSLTTVVGQIVLDQEKPKYAGYIRSKN 

QKISQPIYVKKPAUaEGTEVLKVFtDKYPSKKHDFFVASVLDWGHSTDVGIDVLEVLESMDIVSEFPEAVVKEAESVP 

DAPSQKDMEGRLDLRDEFTFTIDGADAKDLDDAVHIKALKNGNLEFGVHIADVSYYVTEGSALDKEALNRATSVYVTD 

RVVPMLPERLSNGICSLNPQVDRLTQSAIMEIDKHGRVVNYTrTQTVIKTSFRMTYSDVNDILAGDEEKRKEYHKIVSSIE 

LMAKLHETLE^RVKRGALNFDTNEAKILVDKQGKPVD^LRQRGIAERMIESFMLMANETVAEHFSKLDLPnYRIHE 

EPKAEKVQKFIDYASSFGLRIYGTASEISQEALQDIMRAVEGEPYADVLSMMLLRSMQQARYSEHNHGHYGLAADYYT 

HFTSPIRRYPDLLVHRMIRDYGRSKEIAEHFEQVIPEIATQSSNRERRAIEAEREVEAMKKAEYMEEYVGEEYDAVVSSIV 

KFGLFVELPNTVEGLIHrTNLPEFYHFNERDLTLRGEKSGITFRVGQQnURVERADKMTGEIDFSFVPSEFDVIEKGLKQS 

SRSGRGRDSNRR5DKKEDKRKSGRSNDKRKHSQKDKKKKGKKPFYKEVAKKGAKHGKGRGKGRRTKZ 

MGTTGFTIIDLIILIVYLLAVLVAGIYFSKKEMKGKEFFKGDGSVPWYVTSVSIFATMLSPISFLGLAGSSYAGSWILWFA 

QLGMVVAIPLTIRHLPIFARIDIDTAYDYLDKRFNSKALR1ISALLFHYQLGRMSI1MYLPSAGLSVLTGIDINILIILMGVV 

AIVYSYTGGLKSVLWTDFIQGVILISGVVLALFVLIANIKGGFGAVAETLANGKFLAANEKLFDPNLLSNSIFLIVMGSGF 

TII^SYASSQDLVQRFITTQNIKKLNKMLFTNGVLSLATATVFYLIGTGLYVFYQVQNADSAASNIPQDQIFMYFIAYQL 

PVGrTGLILAAIYAASQSTISTGLNSVATSWTLDIQDVISKNMSDNRRTKIAQFVSLAVGLFSIGVSrVMAHSDIKSAYEWF 

NSFMGLVLGLLGGVFILGFVSKKANKQGAYAALIVSTIVMVFIKYFLPPTAVSYWAYSLISISVSVVSGYIVSVLTGNKVS 

APK YTTIHDITEIKADS S WE V RHZ 

MKFSKKYIAAGSAVrVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVlKITDQGYV 

TSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVYLKDAAHADNVRTKDEINRQK 

QEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPADHEDTGNAYIVPHGGHYHYIPKSDLSASELAAAKAHLAGKNM 

QPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHF 

IPYSKLSALEEKIARMVPISGTGSTVSTNAKPNEVVSSU3SLSSNPSSLTTSKELSSASDGYIFNPKDrVEETATAYIVRHGD 

HFHYIPKSNQIGQPTLPNNSLATPSPSLPINPGTSHEKHEEDGYGFDANRIIAEDESGFVMSHGDHNHYFFKKDLTEEQIK 

VRKNIZ 

MKKRAIVAVIVLLLIGLDQLVKSYIVQQIPLGEVR5WIPNFVSLTYLQNRGAAFSILQIX2QLLFAVITLVVVIGArWYLHK 
HMEDSFWMVLGLTLIIAGGLGNFIDRVSQGFVVDMFHLDFINFAIFNVADSYLTVGVIILLXAMLKEEINGNZ 

M NTNLAS FIVGLIIDENDRF YFVQKDGQT Y ALAKEEGQ HTVG DTVKGF A YTDMKQKLRLTTLEVTATQDQFG WGR VT 
EVRKDLGVFVDTGLPDKEIVVSLDILPELKELWPKKGDQLYIRLEVDKKDRJWGLLAYQEDFQRLARPAYNNMQNQN 
WPAIVYRLKLSGTFVYLPENNMLGFIHPSERYAEPRLGQVLDARVIGFREVDRTLNLSLKPRSFEMLENDAQMILTYLE 
SNGGFMTLNDKSSPDDIKATFGISKGQFKKALGGLMKAGKIKQDQFGTELIZ 

MKDVSLFLLKICVFKSRLNWFVLALFVSVLGVTFYLNSQTANSHSLESRLESRIAANERAINENEEKL^QMSDT 

AKNNLDVQKNLLTRKTEILTLLKEGRWKEAYYLQWQDEEKNYEFVSNDPTASPGLKMGVDRERKIYQALYPLNIKAH 

TLEFPTHGIDQIVWILEVIIPSLFVVAIIFMLTQLFAERYQNHLDTAHLYPVSKVTFAISSLGVGVGYVTVLFIGICGFSFLV 

GSLISGFGQLDYPYPIYSLVNQEVTIGKIQDVLFPGLLLAFLAFIVIVEVVYLIAYFFKQKMPVLFLSLIGIVGLLFGIQTIQP 

LQRIAHLIPFTYLRSVEILSGRLPKQIDNVDLNWSMGMVLLPCLIIFLLLGILFIERWGSSQKKEFFNRFZ 

MMKFILDIVSTPAILVALIAILGLVLQKKKLPDIIKGGIKTFVGFLVVSGGAGIVQNSLNPFGTMFEHAFHLSGVVPNNEAI 

VAVALTTYGSATAMIMFAGMVFNILIARFTRFKYIFLTGHHTLYMACMIAVILSVAGFTSLPLILLGGLALGHMSISPAF 

VQKYMVQLTGNDKVALGHFSSLGYWI^GFTGSLIGDKSKSTEDIKFPKSLAFLRDSTVSrTLSMAVIYIIVAIFAGSEYIEK 

EISSGTSGLVYALQLAGQFAAGVFVILAGVRLILGEIVPAFKGISERLVPNSKPALDCPIVYTYAPNAVLIGFISSFVGGLVS 

MVIMIASGTVVILPGVVPHFFCGATAGVIGNASGGVRGATIGAFLQGILISFLPVFLMPVLGGLGFQGSTFSDADFGLSGH 

LGMLNQFGSQAGIVIGLVLILAVMFGVSFIKKPSATEEZ 

MIKTFl^AL^VILFSIPIITYSFFPSSNLNIWLSTQPILAQIYAFPLATATMAAILSFLFFFLSFYKKNKQIRFYSGILLLLSLIL 

LLFGTDKTL^ASNKTKTLKLNTTWNVANQIEAQHIERIFSHFDADMAIFPELATNIRGEQENQRIKLLFHQVGLSMANYD 

IFTSPPTNSGIAPVTVIVKKSYGFYTEAKTFHTTRFGTIVLHSRKQNIPDIlALHTAPPLPGLMEiWKQDLNIIHNQLASKYP 

KAIIAGDFNATMRHGALAKISSHRDAUNALPPFERGTWNSQSPKLFNATIDHILLPKNHYYVKDLDIVSFQNSDHRCIFT 

EITFZ 
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mnpiqrswayvsrkrlrsfilflillvll^gisacltlmk 

envspeletvak1jcdkeavtgeqsverddlsaadnnlvsltaledsskdvtftssafnlkegrhlqkgdskkiliheel 

akknglslhdkigldagqsesgkgqtvefeugifsgkkqekftglssdfsenqvftdyessqtllgnseaqvsaarfyve 

npkemdglmkqven1^len(^yqvekenkafeqikdsvatfqtfltiflygmliagagalilvlslwlrervyevgil 

laixskgkssiflqfclevvlvslxjallpafvagnarttyllqtl^ 

clsvalcflflfrkspkeilssisz 

mlhnafayvtrkffksivwliillmaslsl^^ 

tenkaiesyvkrinaigdltgyduetpetkknltadraki^gsslmrrgvndsskedkfvsgsyklvegehltnddkdk 
illhkdijuvkhgwkvgdkvkldsniydadnekgaketve^ 

tedtaiygdatffvtadknlddvmkelngisg1nwksytlvksssnypaleqsisgmykmanllfwgslsfsvlllall 

i^lwinarrkevgili^iglkqasiu}qfitesiliaipalvs^^ 

gaevdgfsktlssld1siqtsdfiiifvlalvlvvlvmalassnllrkqpkellldgez 

msqdkqmkavspllqrvinissivggvgslifciwayqagilqsketlsafiqqagiwgpplfiflqilqtvvpiipgaltsv 

agvfiyghiigtiynyigrvigcaiifylvrlygaafvqsvvskrtydkyidwldkgnrfdrffifmmiwpispadflcmla 

altkmsfkrymtiiiltkpftlvvytygltyiidffwqmlz 

mrnmwvvncetylrhveswsfffmvlspflflgisvgighlqgssmaknn^ 

asakeaikeeklkgyltidqedsvlkavyhgetslengikfevtgtlnelqnqlnrstaslsqeqekrlaqtiqftekide 

ake^khqtiaagalgfflymilityagvtaqevasekgtkimevvfssirashyfyarmmalflvilthigiyvvggl 

aavllfkdlpfl^qsgildhlgdaislotllhlislfmyvvl^ 

gdnlllkigsyipfistffmpfrtindyaggaeawislaitvifavvatgfigrmyaslvlqtddlgiwktfkralsykz 

mtetiklmkahtsvrrfkeqeipqvdlneiltaaqmasswknfqsysvrvvrsqeickdalyelvpqeairqsavfll^ 

gdlnraekgarlhtdtfqpqgvegllissvdaalagqnallaaeslgyggvnglvrykseevaelfnlpdytysvfg 

malgvpnqhhdmkprlplenvvfeeeyqeqsteaiqaydrvqadyagarattswsqrlaeqfgqaepsstrknleqk 

kli^mlkliaivgtnskrstnrqllqymqkhftdkaeielveikaipvfnkpadkqvpaeileiaakieeadgviigtpeyd 

hsipavlmsalawi^ygiypllnkpimitgasygtlgssraqlqlrqilnapeikanvlpdefllshslqafnpsgdlvdl 

d vuckld aifddfri fvkiteklrn aqellrkd a edfd w en lz 

mntyqlnngveipvlgfgtfkakdgeeayravlealkagyrhidtaaiyqneesvgqaikdsgvpreemfvttklwns 
qqtyeqtrqaleksieklgldyldlylihwpnpkplrendawktrnaevwrameplyqegkiraigvsnflphhldal 
letativpavnqvrlapgvyqdqvvaycrekgilleawgpfgqgelfdskqvqe1aanhgksvaqialawslaegflp 
lpksvttsriqanldcfgielsheeretlktiavqsgaprvddvdfz 

mrckmldpiaiqlgplairwyalcivtgulavyltmkeaprkkiipddildfilvafplailgarlyyvifrfdyysqnlg 
eifaiwngglaiygglitgalvlyifadrklintwdfldlaapsvmiaqslgrwgnffnqeaygatvdnldylpgfirdq 
myiegsyrqptflyeslwnllgfalilifrrkwkslrrghitafyliwygfgrmviegmrtdslmffgfrvsqwlsvvli 
glgimiviyqnrkkapyyiteeenz 

mgklssillgtvsgaal^lfltsdkgkqvcsqaqdflddlredpeyakeqvcekltevkeqatdfvlktkeqvesgeit 
vdsilaqtksyafqateasknqlnnlkeqwqekaealddseeividiteez 

mktklifwgsmlfllslsilltiylawifypmeiqwlnltnrvylkpetiqynfhilmnyltnpfsqvlqmpdfrssaag 
lhhfavvknlfhlvqlvalvtlpsfyvfvnrivkkdflslyrksllalvvlpvmiglggvligfdqfftlfhqilfvgd 
dtwlfdpakdpvimilpetfflhafllffalyenffgylylksrrkz 

mtyhfteeydiivigaghagveaslaasrmgckvllatiniemlafmpcnpsiggsakgivvrevdalggemaktidkt 

yiqmkmlotgkgpavralraqadkelyskemrktvenqenltlrqtmideilvedgkvvgvrtathqeyaakavivt 

tgtalrgeiiigdlkyssgpnhslasinladnlkelgleigrfktgtpprvkassinydvteiqpgdevpnhfsytsrdedy 

vkdqvpcwltytngtshehqnnlhrapmftgvvkgvgprycpsiedkivrfadkerhqlflepegrnteevyvqglst 

slpedvqrdlvhsikglenaemmrtgyaieydmvlphqlratletkkisglftagqtngtsgyeeaagqgiiaginaal 

kiqgkpeliucrsdgyigvmiddlvtkgtiepyrlltsraeyrlilrhdnadmrltemgreiglvdderwarfeikknqf 

dnemkrldsiklkpvketnakveemgfkpltdavtakeflrrpevsyqdvvafigpaaedlddkhelieteikyegyisk 

amdqvakmkrmeekripanidwddidsiatearqkfklinpet1gqasrisgvnpadisilmvylegknrsisktlqkskz 

mtkqvllvddeehilklldyhlskegfstqlvtngrkalalaetepfdfilldtmlpqldgmevckrlrakgvktpim 
mvsaksdefdkvlalelgaddyltkpfsprellarvkavlrrtkgeqegddsdniaddswlfgtlkvyperhevyka 
nkllsltpkefesdknpffevfkvskvtaqz 

mttfkdgflwggavaahqleggwqeggkgisvadvmtagrhgvareitlgvlegkyypnheaidfyhrykedialf 

aemgfkcfrtsiawtrifpkgdelepneeglqfydnlfdeclkngiepvitlshfempyhlvteyggwknrklidffarf 

aevvfkrykdkvkywmtfneinnqanyqedfapftnsgivyeegdnreaimyqaahyelvasaravkigheinpdfqi 
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GCMIAMCPIYPVTCNPKDILMAMKAMQKRYYFADVHVL^^ 
YYMSFATOSHRENNPYFDYLETEDLVKNNYVKASEWEWQ^ 

MVHDDYWEYLGAHIREMKKAVVEDGVDLMGYTPWGCIDLVSAGTGEMRKRYGFIYVDKDDNGKGSYNRSPKKSKi 
WYKEVISSNGESVEZ 



MDQQNGLFGFLENHVMGPMGKUUJFKWRArTAAGMAAVPCT^ 
MVANYATMGSI^LYFVLSL^YELTKIYAEEEELNMNPLNGALLALMAFVMTVI^ 
WAMG^VARFGTTGIFTAIIMAIVTVLIYRMCVKHNWVIK 
10 DIFKVIAIPFGFVSm.TNSWIGLMUYLLTQLLWIVGlHGANIVFAFVSPlALANMAENAAGGHFAVAGEFSNMFVlAGGS 
GATLGLCLYIAFASKSEQLKAIGRASVVPALFmNEPUFGLPIIYNPALAIPFIl^PMVTATIYYVANSU^FIKPIIAQVPWP 
TPVGIGAFLGTADLRAVLVALVCAFAAFLVYLPFIRVYDQKLVKEEQGIZ 

MKKFYVSPIFPILVGLL\FGVI^TFIIFVNNNLLTVLILFLFVGGYVFLFKKLRVHYTRSDVEQIQYVNHQAEESLTALLEQ 
15 MPVGVMKLNLSSGEVEWFNPYAELILTKEDGDFDLEAVQTIIKASVGNPSTYAKLGEKRYAVHMDASSGVLYFVDVSR 
EQArTDELVTSRPVIGIVSVDNYDDLEDETSESDISQINSFVANFISEFSEKHMMFSRRVSMDRFYLFTDYTVLEGLMNDK 
FSVIDAFREESKQRQLPLTLSMGFSYGDGNHDEIGKVALLNLNLAEVRGGDQVVVKENDETKNPVYFGGGSAASIKRT 
RTRTRAMMTAISDKIRSVDQVFVVGHKNLDMDALGSAVGMQLFASNVIENSYALYDEEQMSPDIERAVSFIEKEGVTK 
LUVKDAMGMVTNRSLLILVDHSKTALTI^KEFYDLFTQTIVIDHHRJUXJDFPDNAVITYIESGASSASELVTELIQFQNS 
20 KXNRLSRMQASVLMAGMMLDTKNFTSRVTSRTFDVASYLRTRGSDSIAIQEIAATDFEEYREVNELILQGRKLGSDVLI 
AEAKDMKCYDTVVISKAADAMLAMSGIEASFVLAKNTQGFISISARSRSKLNVQRIMEELGGGGHFNLAAAQIKDVTLS 
EAGEKLTEIVLNEMKEKEKEEZ 

MKEKNMWKELLNRAGWILVFLLAVLLYQVPLVVTSILTLKEVALLQSGLIVAGL^IVVIJVLFIMGARKTKLASFNFSFF 
25 RAKDLARLGl^YLVIVGSNILGSILLQlJSNETTTANQSQINDMVQNSSLISSFFLLALLAPICEEILCRGIVPKiCiFRGKENL 
GFVVGTIVFALLHQPSNLPSLLIYGGMSTVLSWTAYKTQRLEMSILLHMIVNGIAFCLLALVVIMSRTLGISVZ 

MKEKNMWKELLNRAGWILVFLLAVLLYQVPLVVTSILTLKEVALLQSGLIVAGI^IVVLALFIMGARKTKLASFNFSFF 
RAKDLARLGLSYLVIVGSNILGSILLQLSNETTTANQSQINDMVQNSSLISSFFLLALLAPICEEILCRGIVPKKIFRGKENL 
30 GFVVGTrVFALLHQPSNLPSLLIYGGMSTVl^WTAYKTQRLEMSILLHMIVNGIAFCLLALVVIMSRTLGISVZ 

MDTQKlEAAVKMirEAVGEDANREGLQETPARVARMYQEIFSGLGQTAEEHLSKSFEIIDDNMVVEKDIFFHTMCEHHF 
LPFYGRAHIAYIPDGRVAGLSKLARTVEVYSKKPQIQERLNIEVADALMDYLGAKGAFVVIEAEHMCMSMRGVRKPGT 
ATLTTVARGLFETDKDLRDQAYRLMGLZMKDLFLKRKQAFRKECLGYLRYVLNDHFVLFLLVLLGFLAYQYSQLLQH 
35 FPENHWPILLFVG1TSVLLLLWGGTATYMEAPDKLFLLVGEEEIKLHLKRQTGISLVFWLFVQTLFLLLFAPLFLAMGY 
GLPVFLLYVLLLGVGKYFHFCQKASKFFTETGLDWDYVISQESKRKQVLLRFFALFTQVKGISNSVKRRAYLDFILKAV 
QKVPGKIWQNLYLRSYLRNGDLFALSLRLLLLSLLAQVFIEQAWIATAVVVLFNYLLLFQLLALYHAFDYQYLTQLFPL 
DKGQKEKGLQEVVRGLTSFVLLVELVVGLITFQEKLALLALLGAGLVLLVLYLPYQVKRQMQDZ 

40 MRKSlVLAADNAYLIPLETTIKSVLYHNRDVDFYILNSDIAPEWFKLlXj 

SYFRFFATEVVESDRVLYLDSDIIVTGELATLFEIDLKGYSIGAVDDVYAYEGRKSGFhTTGMLLMDVAKWKEHSIVNSL 
LELAAEQNQVVHLGDQSILNIYFEDNWLALDKTYNYMVGIDIYHLAQECERLDDNPPTIVHYASHDKPWNTYSISRLRE 
LWWVYRDLDWSELAFQRSDLNYFERSNQSKKQVMLVTWSADIKHLEYLVQRLPDWHFHLAAPCDCSEELTSLSQYTN 
VTVYQNVLHSRIDWLLDDSIVYLDINTGGEVFNVVTRAQESGKKIFAFDITRKSMDDGLYDGIFSVERPDDLVDRMKNI 

45 EIEZ 

MTKIYSSIAVKKGLFTSFLLnYVLGSRIILPFVDLNTKDFLGGSTAYLAFSAALTGGNLRSLSIFSVGLSPWMSAMILWQ 
MFSFSKRLGLTSTSIEIQDRRKMYLTLLIAVIQSLAVSLRLPVQSSYSAILVVLMNTILLIAGTFFLVWLSDLNASMGIGGSI 
VILLSSMVLNIPQDVLETFQTVHIPTGIIVLLALLTLVFSYLLALMYRARYLVPVNKIGLHNRFKRYSYLEIMLNPAGGMP 
50 YMYVMSFI^VPAYLFILLGFIFPNHSGLAAl^KEFMVGKPLWVYVYISVLFLFSIIFAFVTMNGEEIADRMKKSGEYIYGI 
YPGADTSRFINRLVLRFSVIGGLFNVIMAGGPMLFVLFDEKLLRLAMIPGLFMMFGGMIFTIRDEVKALRLNETYRPLIZ 

MSSl^DQELVAKTVEFRQRLSEGESLDDILVEAFAVVREADKRILGMFPYDVQVMGAW^ 

ATMPVYLNAFSGEGVMVVTPNEYLSKRDAEEMGQVYRFLGLTIGVPFTEDPKKEMKAEEKKLIYASDIIYTTNSNLGFD 
55 YLNDNLASNEEGKFLRPFNYVIIDEIDDILLDSAQTPLIIAGSPRVQSNYYAIIDTLVTTLVEGEDYIFKEEKEEVWLTTKG 
AKSAENFLGIDNLYKBEHASFARHLVYAIRAHKLFTKDKDYIIRGNEMVLVDKGTGRLMEMTKLQGGLHQAIEAKEHV 
KLSPETRAMASrrYQSLFKMFNKISGMTGTGKVAEKEHETYNMSVVRIPTNRPRQRIDYPDNLYITLPEKVYASLEYlKQ 
YHAKGNPLLVFVGSVEMSQLYSSLLFREGIAHNVLNANNAAREAQIISESGQMGAVTVATSMAGRGTDIKLGKGVAEL 
GGLIVIGTERMESQRIDLQIRGRSGRQGDPGMSKFFVSLEDDVIICKFGPSWVHKKYKDYQVQDMTQPEVLKGRKYRKL 
60 VEKAQHASDSAGRSARRQTLEYAESMNlQRDIVYKERNRLIDGSRDLEDVVVDIIERYTEEVAADHYASRELLFHFrsn*N 
ISFHVKEVPDYIDVTDKTAVRSFMKQVIDKELSEKKELLNQHDLY€QFLRLSLLKAIDDNWVEQVDYLQQLSMAIGGQS 
ASQKNPIVEYYQEAYAGFEAMKEQIHADMVRNLLMGLVEVTPKGEIVTHFPZ 

MIGTFAAALVAVLANFlVPIErTPNSANTElAPPDGIGQVLSNLLLKLVDNPVNALLTANYIRILSWAVIFGlAMREASKNS 
65 QELLKTIADVTSKIVEWIINLAPFGILGLVFKTISDKGVGSLANYGILLVLLVTTMLFVAPVVNPLIAFFFMRRNPYPLVW 
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NCXRVSGVTAFFTRSSATNffVNMKLCHDLGLNPOT^ 

VAAISSCDASGIAGGSLLLIPVACSLFGISNDIAIQIVGVGFVIGVIQDSCETALNSSTDVLFTAVAEYAATWOCZ 



MSISQRTTKLIlJVTCLACLIJVYFLNIJSSAVSAGnALl^l^DTRRSTLKLARNRLFSM 
LAFWPUVYKMGWEIGrTPSTVLVSHLLVQESTSPDLLVNE^ 

LQRPKYYl^RGDGRNRAQLVAELDTLLKEALRLVYLDHSDHLFHQTDYHrHYFEMRQRQSRILRNMAQQlNTCHLAAS 
ESLILAQLFSKIAGQLSQTNPASDLLDEIERYLEVFRNRSLPKTREEFETRATLLQLLREAKTFIQVKVDFYQKYRQZ 

MEIMSLAIAVFAVIIGLVIGYVSISAKMKSSQEAAELMLLNAEQEATNLRGQAEREADLLVNEAKRESKSLKKEALLEAK 

EEARKYREEVDAEFKSERQELKQIESRLTERATSLDRKDDNLTSKEQTLEQKEQSISDRAKNLDAREEQLEEVERQKEAE 

LERIGALSQAEARDIIIJVQTEENLTREIASRIREAEQEVKERSDKMAKDELVQAMQRIAGEYVAESTNSTVHLPDDTMKG 

RIIGREGRNIRTFESLTGVDVIIDDTPEVVTLSGFDPIRREIARMTMEMLLKDGRIHPARIEELVEKNRQEIDNKIREYGEA 

AAYEIGAPNLHPDLMKIMGRLQFRTSYGQNVLRHSIEVAKLAGIMASELGENAALARRAGFLHDIGKAIDHEVEGSH^ 

IGMELARKYKEPPVVVNTIASHHGDVEAESVIAVIVAAADALSAARPGARSESLESYIKRLHDLEEIANGFEGVQTSFAL 

QAGREIRIMVNPGKDCDDKVTILAHKVRKKIENNLDYPGNIKVTVIRELRAVDYAKZ 

MMLKPSIDTLLDKVPSKYSLVILEAKRAHELEAGAPATQGFKSEKSTLRALEEIESG^^VTIHPDPEGKREAVRRRIEEEKR 
RKEEEEKKDCEQIAKEKEDGEKIZ 

MSAYQLPTVWQDEASNQGAFTGLNRFTAGARFEQNLPKGEQAFQLYSLGTPNGVKVTILLEELLEAGFKEAAYDLYKI 
AIMDGDQFGSDFVKLNPNSKIPALLDQSGTENVRVFESAHILLYLAEKFGAFLPSNPVEKVEVLNWLFWQAGAAPFLG 
GGFGHFFNYAPEKLEYPINRFTMEVKRQLDLLDKELAQKPYIAGNDYTIADLAIWSWYGQLVQGNLYQGSAKFLDASS 
YQNLVKWAEKIANRPAVKRGLEVTYTEIKZ 

UVSLITSUMFYVGFDVLRDTIQKIl^REETVIDPLGATLGHS^ 

GTAIAILASSFNYPIVDKLVAIIITFFILKTAYDIFIESSFSLSDGFDDRLLEDYQKAIMEIPKISKVKSQRGRTYGSNIYLDIT 
LEMNPDLSVFESHEIADQVESMLEERFGVFDTDVHIEPAPIPEDEILDNVYKKLLMREQLIDQGNQLEELLTDDFVY1RQ 
DGEQMDKEAYKTKKELNSA1KDIQITSISQKTKLICYELDGIIHTSIWRRHETWQNIFHQETKKEZ 

KTTIKLVATDMDGTFLDGNGRFDMDRUCSLLVSYKEKGIYFAVASGRGFLSLEKLFAGVRDDHFIAENGSLVEYQGQDL 
YEATMSRDFYLATFEKLKTSPYVDINKLLLTGKKGSYVLDTVDETYLKVSQHYNENIQKVASLEDITDDIFKFTTNFTEE 
TLEDGEAWVNENVPGVKANnTGFESlDIVLDYVDKGVArVELVKKLGrrMIX?VMAFGDNLNDLHMMQVVGHPVAP 
NARPEILELAKTVIGHHKERSVIAYMEGLZ 

MADIKUALDLDGTLLTTDKRLTDRTKETLQAARDRGIK^ 

NTGEILDKTVFSYDDVARLYEETEKLSLPLDAISEGTVYQIQSDQESLYAKFNPALTFVPVDFEDLSSQMTYNKCVTAFA 

QEPLDAAIQKISPELFDQYEIFKSREMLLEWSPKNVHKATGLAKLISHLGIE5QSQVMACGDEANDLSMIEWAGLGVAM 

QNAVPEVKAAANVVTPMTNDEEAVAWAIEEYVLKENZ 

MESLLILLLIANLAGLFLIWQRQDRQEKHLSKSLEDQADHLSDQLDYRFDQARQASQLDQKDLEVVVSDRLQEVRIELH 
QGLTQVRQEMTDNLLQTRDKTDQRLQALQESNEQRLEQMRQTVEEKLEKTLQTRLQASFETVSKQLESVNRGLGEMQ 
TVARDVGALNKVLSGTKTRGILGELQLGQUEDIMTPAQYEREYATVENSSERVEYAIKLPGQGDQEYVYLPIDSKFPLA 
DYYRLEEAYETGDKDEIERCRKSLLASVKRFARDIRNKYIAPPRTTNFGVLFVPTEGLYSEIVRNPVFFDDLRREEQUVA 
GPSTLSALLNSLSVGFKTLNIQKSADHISKTLASVKTEFGKFGG1LVKAQKHLQHASGNIDELLNRRTIAIERTLRHIELSE 
GEPALD LLHFQENEEE YEDZ 

MKISHMKKDELFEGFYLIKSADLRQTRAGKNYLAFTFQDDSGEIDGKLWDAQPHNIEAFTAGKVVHMKGRREVYNNT 
PQVNQITLRLPQAGEPNDPADFKVKSPVDVKEIRDYMSQMIFKIENPVWQRIVRNLYTKYDFCEFYSYPAAKTNHHAFET 
GI^YHTATMVRUVDAlSEVYPQLNKSLLYAGIMLHDLAKVIELTGPDQTEYTVRGNLLGHIALIDSEmCTVMELGID 
KEEVVLLRHVII^HHGLLEYGSPVRPRIMEAEHHMIDNLDASMMMMSTALALVDKGEMTNKIFAMDNRSFYKPDLDZ 

MSEKAKKGFKMPSSYTVLLIIIAIMAVLTWFIPAGAFIEGIYETQPQNPQGIWDVLMAPIRAMLGTHPEEGSLIKETSAAID 
VAFFILMVGGFLGIVNKTGALDVG1ASIVKKYKGREKMULVLMPLFALGGTTYGMGEETMAFYPLLVPVMMAVGFDS 
LTGVAIILLGSQIGCLASTLNPFATGIASATAGVGTGIXHVLRLIFWVTL^ 

LKHFNVEESSSVESTLSSKQKSVLFLFVLTFILMVLSFIPWTDLGVTIFDDFNTWLTGLPVIGNIVGSSTSALGTWYFPEG 

AMLFAFMGIUGVIYGLKEDKUSSFMNGAADLl^VALIVAIARGIQVIMNDGMITDTILhfWGKEGLSGl^QVFIVVTYIF 

YLPMSFLIPSSSGLASATMGIMAPLGEFVNVRPSLIITAYQSASGVLNLIAPTSGIVMGALALGRINIGTWWKFMGKLVVA 

IIVVTIALLLLGTFLPFLZ 

MSNSFVKLLVSQLFANLADIFFRVTIIANIYIISKSVIATSLVPILIGISSFVASLLVPLVTKRLALNRVLSLSQFGKTILLAIL 

VGMFTVMQSVAPLVTYLFVVAISILDGFAAPVSYAIVPRYATDLGKANSALSMTGEAVQUGWGLGGLLFATIGLLPTT 

CINLVLYIISSFLMLFLPNAEVEVLESETNLEILLKGWKLVARNPRLRLFVSANLLEIFSNTIWVSSIILVFVTELLNKTESY 

WGYSNTAYSIGIIISGLIAFRLSEKFLAAKWEPQLFTPNLKTIQNPCLSLDPGWFLFSPNGCFLLDKKEFPLYGISVEKNTK 

RKETHMNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYLVTNDQRRYCRYSDSDILVQFLFQLL 
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TGYGTDYACKELSADAYFPiaLEGGQl^SQPTI^RFLSRTO^ 

GKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEEADSHTPVLERFNQLLFRMDSGFATPKLYDLIE 
CTGQYYLKLKKNTVLSRLGDI^LrcPQDEDLTILP^ 

GTSQDQFQLYRGRGQAENFKEMKEGFFGDKTDSSTLIKNEVRMMMSCIAYNLYLFLKHI^GGDFQTLTIKRFRHLFL 
5 HVVGKCVRTGRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMHZ 

MMEFFQQLPHLEPYGNPQYFVYVIAATLPIFIGLFFKKRFAWYEVLVSLFnVTMLVGGKTNQLAALGIYLCWEILLLLF 
YKHYRXSKDGKWVFYLVSF^LLPIIFVKVQPAINGTQSLLGFLGlSYLTFRSVGIVIELRDGVnCDFTLWEFLRFLLFMPT 
FSSGPIDRFKRFNENYQAIPERDELMDMLDESVRYIMWGFLYKFILAHVLGETLLPPIJCNLALQSGGFFNLYALAVMYT 
10 FGLELFFDFAGYSMFAUUSNLMGIRSPINFNKPFLSRDLKEFWNRWHMSI^FWFRDFVFMRMVMVLTRKKVFKNRN 
VTSSVLAYIVNMLIMGFWHGVTWYYIAYGLFHGIXjLVINDA 
VVMLSFLIFSGFLNNLWFKKZ 

MLKRLWMIFGPVLIAGLLVFLLIFFYPTEMHHNLGAEKRSAVATTIDSFKERSQKVRAl^DPNVRFVPFFGSSEWLRFD 
15 GAHPAVLAEKYNRSYRPYLLGQGGAASLNQYFGMQQMLPQLENKQVVYVISPQWFSKNGYDPAAFQQYFNGDQLTS 
FLKHQSGDQASQYAATRLLQQFPNVAMKDLVQKLASKEELSTADNEMIELLARFNERQASFFGQFSVRGYVNYDKHV 
AKYLKILPIXJFSYQAIEDVVKADAEKNTTSNNEMGMENYFYNEQIKKDLKKLKDSQKSFTYUCSPEYNDLQLVLTQFSK 
SKVNPiniPPVNKKWMNYAGLREDMYQQTVQKIRYQLESQGFTNlADFSKDGGEPFFMKDTIHLGWLGWLAFDKAVD 
PFLSNPTPAPTYHLNERFFSKDWATYDGDVKEFQZ 

20 

MEKNLKALKQTTDQEGPAIEPEKAEDTKTVQNGYFEDAAVKDRTLSDYAGNWQSVYPFLEDGTFDQVFDYKAKLTG 
KNfTQAEYKAYYTKGYHTDVTKINITDOTMEFVQGGQSKKYTYKYVGKKILTYKKGNRGVRFLFEATDADAGQFKYV 
QFSDHNVAPVKAEHFHIFFGGTSQEALFEEMDNWPTYYPDNLSGQEIAQEMLAHZ 

25 MKDGHLUVHHIRLLNGRIFQKLLSQDPEALYRGEQGKILAVLWNSETGCATATDlALATGLANhTTLTTMIKKLEEQKL 
VIVSPCGKDKRiCKYLVLTELGKSQKEVGHRVSQKLDTIFYKGFSEEEIHQFEGFQERlLANLKEKGNEVZ 

MTNLIATFQDRFSDWLTAl^QHLQI^LLTLLLAILLAIPL^^ 

PALTALVIYAIFPILQNTITGLKGIDPNLQEAGIAFGMTRWERLKKFEIPLAMPVIMSGIRTAAVLIIGTATLAALIGAGGL 
30 GSFILLGIDRNNASLILIGALSSAVLA1AFNFLLKVMEKAKLRTIFSGFALVALLLGLSYSPALLVQKEKENLVIAGKIGPEP 
EILANMYKLLIEEhn-SMTATVKPNFGKTSFLYEALKKGDIDIYPEFTGTVTESLLQPSPKVSHEPEQVYQVARDGIAKQD 
HLAYLKPMSYQOTYAVAVPKKIAQEYGtKTISDLKKVEGQLKAGFTLEFNDREDGNKGLQSMYGLNLNVATIEPALRY 
QAIQSGDIQITDAYSTDAELERYDLQVLEDDKQLFPPYQGAPLMKEALLKKHPELERVLNTLAGKITESQMSQLNYQVG 
VEGKSAKQVAKEFLQEQGLLKKZ 

35 

MMHTYLQKKIENIKTTLGEMSGGYRRMVAAMADLGFSGTMKAIWDDLFAHRSFAQWIYLLVLGSFPLWLELVYEHRI 
VDWIGMICSLTGIICVIFVSEGRASNYLFGLINSVIYLILALQKGFYGEVLTTLYFTVMQPIGLLVWIYQAQFKKEKQEFV 
ARKLDGKGWTKYI^ISVLWWLAFGFIYQSIGANRPYRDSITDATNGVGQILMTAVYREQWIFWAATNVFSIYLWWGES 
LQIQGKYLIYLINSLVGWYQWSKAAKQNTDLLNZ 

40 

MRNMKAKYAVWVAFFLNLTYAIVEFIAGGVFGSSAVLADSVHDLGDAIAIGISAFLETISNREEDNQYTLGYKRFSLLG 
ALVTAVILVTGSVLVILENNnTCILHPQPVNDEGILWLGIIArriNLLASLVVGKGKTKNESILSLHFLEDTLGWVAVILMAI 
VLRFTDWYILDPLLSLVISFFILSKALPRFWSTLKIFLDAVPEGLDIKQVKSGLERLDNVASLNQLNLWTMDALEKNAIV 
HVCLKEMEHMETCKESIRIFLKDCGFQNITIEIDADLETHQTHKRKVCDLERSYEHQHZ 

45 

MIEYKNVALRYTEKDVLRDVNLQIEDGEFMVLVGPSGSGKTTMLKMINRLLEPTDGNIYMDGKRIKDYDERELRI^TG 
YVLQAIALFPNLTVAENIALIPEMKGWSKEEITKKTEELLAKVGLPVAEYGHRLPSELSGGEQQRVGIVRAMIGQPKIFL 
MDEPFSALDAISRKQLQVLTKELHKEFGMTTIFVTHDTDEALKLADRIAVLQDGEIRQVANPETILKAPATDFVADLFG 

GSVHDZ 

50 

MSAVAISAMTXVMQETHGNPSSIHGHGRQAGKLLREARQELAQLLRTKPQHIFFTSGGTEGNNTTnGYCLRHQEQGKH 
IITTAIEHHAVLETIDYLVQHFGFEATIIQPENQEITAQQIQKALRDDTILVSTMFVNNETGNLLPIAEIGQILKQHPAAYH 
VDAVQAIGKIPIHSEELGIDFLTASAHKFHGPKGIGFLYASSMDFDSYLHGGDQEQKKRAGTENLPAIVGMVAALKEDL 
EKQEEHFQHVQNLETAFLAELEGIQYYLNRGKHHLPYVLNIGFPGQKNDLLLLRLDLAGISISTGSACTAGVVQSSHVLE 
55 AMYGANSERLKESLRISLSPQNTVEDLQTLAKTLKEIIGGZ 

MLFKLSKEKIELGl^RLSPARRIFl^FALVILLGSLLl^LPFVQVESSRATYFDHLFTAVSAVCVTGl^TLPVAHTYNIW^ 
QnCLLLIQIGGLGLMTFIGVFYIQSKQKLSLRSRATIQDSFSYGETRSLRKFVYSIFLTTFLVESLGAILLSFRLIPQLGWGR 
60 GLFSSIFLAISAFCNAGFDNLGSTSLFAFQTDLLVNLVIAGLIITGGLGFMVWFDLAGHVGRKKKGRLHFHTKLVLLLTI 
GLLLFGTATTLFLEWNNAGTIGNLPVADKVLVSFFQTVTMRTAGFST1DYTQAHPVTLLIYILQMFLGGAPGGTAGGLK 
nTFFVLLVFARSELLGLPHANVARRTIAPRTVQKSFSVFIIFLMSFLIGLILLGlTAKGNPPFIHLVFETISALSTVGVTANL 
TPDLGKLALSVIMPLMFMGRIGPLTLFVSLADYHPEKKDMIHYMKADISIGZ 

65 
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MSDRTIGILGLGIFGSSVLAALAKQDMNIIAIDDHAERINQFEPVLARGVIGDITDEELLRSAGIDTCDTVVVATGENLESS 
VLAVMHCKSLGVPTVIAKVKSQTAKKVLEKIGADSVISPEYEMGQSLAQTILFHNSVDVFQLDKWSIVEMKIPQSWAG 
QSLSKLDLRGKYNLNILGFREQENSPLDVEFGPDDLLKADTYILAVINNQYLDTLVALNSZ 

5 

MKLLSIAISSYNAAAYLHYCVESLVIGGEQVGILHNDGSQDQTQEIAECLASKYPNIVRAIYQENKCHGGAVNRGLVEAS 

GRYFKVVDSDDWVDPRAYLKILETLQELESKGQEVDVFVTNFVYEKEGQSRKKSMSYDSVLPVRQIFGWDQVGNFSK 

GQYTMMHSLIYRTDLLRASQFZ 

10 

MKFNPNQRYTRWSIRRLSVGVASVVVASGFFVLVGQPSSVRADGLNPTPGQVLPEETSGTKEGDLSEKPGDTVLTQAKP 
EGVTGNTNSLPTPTERTEVSEETSPSSLDTLFEKDEEAQKNPELTDVLKE^ 

TKDSIDVPAAYLEKAEGKGPFTAGVNQVIPYELFAGDGMLTRLLLKASDNAPWSDNGTAKNPALPPLEGLTKGKYFYE 
VDLNGNTP/GKQGQALIDQLRANGTQTYKATVKVYGNKDGKADLTNLVATKNVDININGLVAKETVQKAVADNVKDS 
15 IDVPAAYLEKAKGEGPFTAGVNHVIPYELFAGDGMLTRLLLKASDKAPWSDNGDAKNPALSPLGENVKTKGQYFYQV 
ALDGNVAGKEKQALIDQFRANGTQTYSATVNVYGNKIXjKPDLDNIVATKKVTININGLISKETVQKAVADNVKDSIDV 
PAAYLEKAKGEGPFTAGVNHVIPYELFAGDGMLTRLLLKASDKAPWSDNGDAKNPALSPLGENVKTKGQYFYQLALD 
GNVAGKEKQALIDQFRANGTQTYSATVNVYGNKDGKPDLDNIVATKKVTININGLISKETVQKAVADNVKTVSMFQQP 
TZ 

20 

MKLKSYILVGY^STLLTILVVFWAVQKML^AXGEIYFLLG^^IVASLVGAGISLFLLLPVF^SLGKLKEHAKRVAAKDFP 
SNLEVQGPVEFQQLGQTFNEMSHDLQVSFDSLEESEREKGLMIAQLSHDIKTPITSIQATVEGILDGIIKESEQAHYLATIG 
RQTERLNKLVEELNFLTLNTARNQVETTSKDSIFLDKLLIECMSEFQFLIEQERRDVHLQVIPESARIEGDYAKLSRILVN 
25 LVDNAFKYSAPGTKLEVVAKLEKDQLSISVTDEGQGIAPEDLENIFKRLYRVETSRNMKTGGHGLGLAIARELAHQLGG 
EITVSSQYGLGSTFTLVLNLSGSENKAZ 

MFGQTAQHGLTNSLKDFWIFLLNIGPQLAFFCQMLRCSRSVEQGTGNHRREFNMIQQIFSHFGMTHLGQIKLVYQESID 
30 LELLVNALNHHLLIDRLVLTPNQITIEIDRQIVHGLDLLKGRKDKEIIDIKSMFRQLELASTQQICPINQRVHHGILAFGEIS 
DLVPAKNLPNRQDZ 

MEHLATYFSTYGGAFFAALGIVLAVGLSGMGSAYGVGKAGQSAAALLKEQPEKFASALILQLLPGTQGLYGFVIGILIW 
35 LQLTPELPLEKGVAYFFVALPIAIVGYFSAKHQGNVAVAGMQILAKRPKEFMKGAILAAMVETYAILAFVVSFILTLRVZ 

MLKSEKQSRYQMLNEEI^FLLEGETNVLANl^NASALIKSRFP^r^VFAGFYLFDGKELVLGPFQGGVSCIRlALGKGVC 
GEAAHFQETVIVGDVTTYLNYISCDSLAKSEIVVPMMKNGQLLGVLDLDSSEIEDYDAMDRDYLEQFVAILLEKTAWD 
40 FTMFEEKSZ 

MSVLEIKDLHVEIEGKEIUCGVNLTLICTGEIAAJMGPNGTGKSTI^AAIMGNPNYEVTKGEVLFDGVNILELEVDERAR 
MGLFLAMQYPSEIPGITNAEFLRAAMNAGKEDDEKISVREFITKLDEKMELLNMKEEMAERYLNEGFSGGEKKRNEIL 
45 QLU4LEPTFALLDEIDSGLDIDALKVVSKGVNAMRGEGFGAMIITHYQRLLNYITPDVVHVMMEGRWLSGGPELAAR 
LEREGYAKLAEELGYDYKEELZ 

MPYKRQRSFSMALSKLDSLYMAVVADHSKNPHHQGKLEDAEQISLNNPTCGDVINLSVKFDAEDRLEDIAFLNSGCTIS 
50 TASASMMTDAVLGKTKQEILELATIFSEMVQGQKDERQDQLGDAAFLSGVAKFPQRIKCATLAWNALKKTIENQEKQZ 

MKIQDLLRKDVMLLDLQATEKTAVIDEMIKNLTDHGYVTDFETFKEGILAREALTSTGLGIX5IAMPHSKNAAVKEATV 
LFAKSNKGVDYESLDGQATDLFFMIAAPEGANDTHLAALAELSQYLMKDGFADKLRQATSADQVIELFDQASEiCTEEL 

55 VQAPANDSGDFIVAVTACTTGIAHTYMAQEALQKVAAEMGVGIKVETNGASGVGNQLTAEDIRKAKAIIIAADKAVEM 
DRFDGKPLINRPVADGIRKTEEUNLAl^GDTEVYRAANGAKAATASNEKQSLGGALYKHLMSGVSQMLPFVIGGGIMI 
ALAFLIDGALGVPNENLGNLGSYHELASMFMKIGGAAFGLMLPVFAGYVAYS(AEKPGLVAGFVAGAIAKEGFAFGKIP 
YAAGGEATSTLAGVSSGFLGALVGGFIAGALVLAIKKYVKVPRSLEGAKSILLLPLLXjTILTGFVMLAVNIPMAAINTAM 
NDFLGGLGGGSAVLLGIVLGGMMAVDMGGPVNKAAYVFGTGTLAATVSSGGSVAMAAVMAGGMVPPLAIFVATLLF 

60 KDKFTKEERNSGLTNHMGLSFITEGAIPFGAADPARAIPSFILGSAVAGGLVGLTGIKLMAPHGGIFVIALTSNALLYLVS 
VLVGAIVSGVVYGYLRKPQAZ 

MANKNTSTTRRRPSKAELERKEAIQRMLISLGIAILLIFAAFKLGAAGITLYNLIRLLVGSLAYLAIFGLLIYLFFFKWIRK 
65 QEGLLSGFFTIFAGLLLIFEAYLVWKYGLDKSVLKGTMAQVVTDLTGFRTTSFAGGGLIGVALYIPTAFLFSNIGTYFIGS 
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ILILVGSLLVSPWSVYDIAEFFSRGFAKWWEGHERRKEERFVKQEEICA 

ILTEEAVQNLPPIPEEKWVEPEIILPQAELKFPEQEDDSDDEDVQVDFSAKEALEYKLPSLQLFAPDKPKDQSKEKKIVRE 
NIKILEATFASFGIKVTVERAEIGPSVTKYEVKPAVGVRVNRISNL^DDl^LALAAXDVW 

SFRELWEQSQTKAENFLEIPLGKAVNGTARAFDI^KMPHLLVAGSTGSGKSVAVNGIIASILMKARPDQVKFMMVDPK 
5 MVELSVYNDIPHLLIPVVTNPRKASKALQKVVDEMENRYELFAKVGVRNIAGFNAKVEEFNSQSEYKQIPLPFIVVIVDE 
1JVDLMMVASKEVEDAIIRLGQKARAAGIHMILATQRPSVDVISGLIKANVPSRVAFAVSSGTDSRTILDENGAEKLLGRG 
DMLFKPIDENHPVRLQGSFISDDDVERIVNFIKTQADADYDESFDPGEVSENEGEFSDGDAGGDPLFEEAKSLVIETQKA 
SASMIQRRLSVGFNRATRLMEELEIAGVIGPAEGTKPRKVLQQZ 

10 

MSYFKKYKFDKSQFKLGMRTFKTGIAVFLVLLIFGFFGWKGLQIGALTAVFSLRESFDESVHFGTSRILGNSIGGLYALV 
FFLLm*FFHEAFWVTLVWPICTMLTIMTNVAMNNKAGVIGGVAAMLirri^[PSGETILYVFVRVLETFMGVFVAnVN 
YDIDRIRLFLEKKEKZ 

15 

M^^SEHRHQLIRALr^K^4KIHTQAELQALLAENDIQVTQATl^RDIKNMNLSKVREEDSAYYVLNNGSISICWEKRLELY 
MEDALVWMRPVQHQVLLKTLPGLAQSFGSHDTLSFPDAIATLCGNDVCLnCEDADTAQKCFEELKKFAPPFFFEEZ 



20 MKSIKLNALSYMGIRVLNIIFPILTGTYVARVLDRTDYGYFNSVDTILSFFLPFATYGVYNYGLRAISNVKDNKKDLNRT 
FSSLFYLCLACnLTTAVYILAYPLFFTDNPIVKKVYLVMGIQLLAQIFSIEWVNEALENYSFLFYKTAFIRJLMLVSIFLFVK 
NEHDIVVYTLVMSl^TLINYLISYFWIKRDIKLVKIHLSDFKPLFLPLTAMLVFANANMLFTFLDRLFLVKTGIDVNVSY 
YTIAQRIVTVLAGVVTGAIGVSVPRLSYYLGKGDKEAYVSLVNRGSRIFNFFIIPLSFGLMVLGPNAILLYGSEKYIGGGIL 
TSLFAFRTnLALDTILGSQILFTNGYEKRITVYTVFAGLLNLGLNSLLFFNHIVAPEYYLLTTMLSETSLLVFYIIFIHRKQL 

25 IHUiHIFSYTVRYSLFSI^FVAlYFLINFVYPVDMVINLPFLINTGLIVLl^AISYISLLVFTKDSIFYEFLNHVLALKNKFKK 
SZ 



MELFMKITNYEIYKLKKSGLTNQQILKVLEYGENVDQELLLGDIADISGCRNPAVFMERYFQIDDAHLSKEFQKFPSFSIL 
30 DDCYPWDLSEIYDAPVLLFYKGNLDLLKFPKVAVVGSRACSKQGAKSVEKVIQGLENELVIVSGLAKGIDTAAHMAAL 
QNGGKTIAVIGTGLDVFYPKANKRLQDYIGNDHLVLSEYGPGEQPLKFHFPARNRIIAGLCRGVIVAEAKMRSGSLITCE 
RAMEEGRDVFAIPGSILDGLSDGCHHLIQEGAKLVTSGQDVLAEFEFZ 



MKQLTVEDAKQIELEILDYIDTLCKKHNINYIINYGTLIGAVRHEGFIPWDDDIDLSMPREDYQRFINIFQKEKSKYKLLS 
LETDKNYFNNnKITDSTTKUDTRNTKTYESGIFlDIFPIDRFDDPKVIDTCYKLESFKLLSFSKHKNIVYKDSLLKDWIRT 
AFWLLLRPVSPRYFANKIEKEIQKYSRENGQYMAFIPSKFKEKEVFPSGTFDKTIDLPFENLSLPAPEKFDTILTQFYGDY 
MTLPPEEKRFYSHEFHAYKLEDZ 



MIKINHLTITQNKDLRDLVSDLTMTIQDGEKVAIIGEEGNGKSTLLKILMGEALSDFTIKGNIQSDYQSLAYIPQKVPEDL 

KKKTLHDYFFLDSIDLDYSILYRI^EELHFDSNRFASDQEIGNl^GGEALKIQLIHELAKPFEILFLDEPSNDLDLETVDW 

LKGQIQKTRQTVIFISHDEDFLSETADTIVHLRLVKHRKEAETLVEHLDYDSYSEQRKANFAKQSQQAANNQRAYDKT 

MEKHRRVKQNVETALRATKDSTAGRLLAKKMKTVLSQEKRYEKAAQSMTQKPLEEEQIQLFFSDIQPLPASKVLVQLE 

KENLSIDDRVLVQKLQLTVRGQEK1GIIGPNGVGKSTLLAKLQRLLNDKREISLGFMPQDYHKKLQLDLSPIAYLSKTGE 

KEELQKIQSHLASLNFSYPEMQHQIRSLSGGQQGKLLLLDLVLRKPNFLLLDEPTRNFSPTSQPQIRKLFATYPGGLITVS 

HDRRFLKEVCSIIYRMTEHGLKLVNLEDLZ 



MKPKTFYNLLAEQNLPLSDQQKEQFERYFELLVEWNEKINLTAITDKEEVYLKHFYDSIAPILQGLIPNETIKLLDIGAGA 
GFPSLPMKILYPELDVTIIDSLNKRINFLQLLAQELDLNGVHFYHGRAEDFAQDKNFRAQYDFVTARAVARMQVLSELT 
IPYLKVGGKLIJ^UCASNAPEELLEAKNALNLLFSKVEDNLSYALPNRDPRYITVVEKKKETPNKYPRKAGMPNKRPLZ 



MSIKLJAVDIDGTLVNSQKEITPEVFSAIQDAKEAGVKVVIATGRPIAGVAKLLDDLQLRDEGDYVVTFNGALVQETATG 
HEIISESLTYEDYLDMEFLSRKLGVHMHAITKDGIYTANRN1GKYTVHESTLVSMPIFYRTPEEMAGKEIVKCMFIDEPEI 
LDAAIEKIPAEFYERYSINKSAPFYLELLKKNVDKGSAITHLAEKLGLTKDETMAIGDEENDRAMLEVVGNPVVMENGN 
PEIKKIAKYITKTNDESGVAHA1RTWVLZ 



MTWIILGVIALIVIFVIVSYNGLVKNRMQTKEAWSQIDVQLKRRNDLLPNLIETVKGYAKYEGSTLEKVAELRNQVAAA 
TSPAEAMKASDALTRQVSGIFAVAESYPDLKASANFVKLQEELTNTENKISYSRQLYNSVVSNYNVKLETFPSNIIAGMF 
GFKAADFLQTPEEEKSVPKVDFSGLGDZ 
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MLFDQIASNKRKTWILLLVFFLLLALVGYAVGYLFIR^ 

LYHVVEDMALVAQIPMPRVFIIDDPALNAFATGSNPQNAAVAATSGLLAIMNREELEAVMGHEVSHIRNYDIRISTIAV 

ALASAITMI^SMAGRMMWGGAGRRRSDDDRIXJNGLEIIMLVVSLUUVLAPLAATLVQLAISRQREFL 

RNPQGMINALDKLDNSKPMSRHVDDASSALYINDPKKGGGFQKLFYTHPP1SERIERLKQMZ 



MKLNIQEIRKQSEGLNFEQTLDLVDDLRARNQEILDVKDILAVGKVQYEDRMYFLDYQLSYTIVLASSRSMEPVELVES 
YPVTEVFMEGATNQLDQEVLDDDLVLPIENGELDLAESVSDNILLNIPIKVLTAEEEAGQGFISGNDWQIMTEEEYQAQ 
KAVKKEENSPFAGLQGLFDGDEZ 



MKRQLALVVFSGGQDSTTCLFWVMQHYETVEAVTFAYGQRHHLEIQITREIAKEQGIRHHILDMSLLGQITAQPDFATI 
HISYIPDKLCVESKSLKLYLFSYRNHGDFHENCINTIGKDLVNLLDPRYLEVWGKFTPRGGISIDPYYNYGKQGTKYEGL 
AEQRLFQHDLYPEKIDNRZ 



MTETVEDKVSHSITGLDILXGrVAAGAVISGTVATQTKVFTNESAVLEKTVEKTDALATNDTVVLGTISTSNSASSTSLSA 

SESASTSASESASTSASTSASTSASESASTSASTSISASSTVVGSQTAAATEATAKKVEEDRKKPASDYVASVTNVNLQSYA 

KRRKRSVDSIEQLIJ^SIKNAAVFSGNTIVNGAPAINASLNIAKSETKVYTGEGVDSVYRVPIYYKLKVTNDGSKLTFTYT 

VTYVNPKTNDLGNISSMRPGYSIYNSGTSTQTMLTLGSDLGKPSGVKNYITDKNGRQVI^YhrrSTMTTQGSGYTWGNG 

AQMNGFFAKKGYGLTSSWTVPITGTDTSFTFTPYAARTDRIGINYFNGGGKVVESSTTSQSLSQSKSLSVSASQSASASAS 

TSASASASTSASASASTSASASASTSASVSASTSASASASTSASASASTSASESASTSASASASTSASASASTSASASASTSASE 

SASTSASASASTSASESASTSASASASTSASASASTSASGSASTSTSASASTSASASASTSASASASISASESASTSASESASTST 

SASASTSASESASTSASASASTSASASASTSASASASTSASASTSASESASTSASASASTSASASASTSASASASTSASASASTS 

ASVSASTSASASASTSASASASTSASESASTSASASASTSASASASTSASASASTSASESASTSASASASTSASESASTSASASA 

STSASASASTSASGSASTSTSASASTSASASASTSASASASISASESASTSASESASTSTSASASTSASESASTSASASASTSASA 

SASTSASASARQVRRPQPVHLNRHQPVRQPQQVLVHQLQHQRVHRLQHQPVPRLQRQPVRQLQQVPVLQSQHQQVLQ 

PQHRQVPRLQQAHQHLNQRRQAPQLQQVPVRQPQRRQVRQPQQVLVHQLQHQRVHRLRRQPVHQSQQVPVRQLPHQ 

QVPRLQQAPVRRLQQVLAPQPQPQPVRQPQQVSQRLNRHQRVRPLQQVLAPQPQRQQVHRLQRQRVRLNRHQRVRPL 

QQVLAPQPQRQQVHRLQHQRVRPLQQVLAPQPQRQQVHRLQRQRVRLSQHQRVRQPQQAHQLLNLHQPVRQPQHRQ 

APQLQQVPVRQPQRRQVRRLQQVPVRQPQQVPVRQPQRRQVRRPQPVHLNRHQPVRQPQQVLVHQLQHQRVHRLQH 

QPVHQSQQVPVRQFRINKCLGFSKYZ 



MGVETWFYSSICWI^IGLGSVWKFPYMTAANGGGGFLLIFLISTILIGFPLLLAEFALGRSAGVSAIKTFGKLGKNNKYN 

FIGWIGAFALFILLSFYSVIGGWILVYLGIEFGKLFQUjGTGDYAQLFTSIISNPAIALGAQAAFILLNIFIVSRGVQKGIERA 

SKVMMPLLFIVFVFHGRSLSLPNAMEGVLYFLKPDFSKLTSTGLLYALGQSFFALSLGVTVMLTYASYLDKXTNLVQSG 

ISIVAMNISISIMAGLAIFQARSPFNIQSEGGPSLLFIVLPQLFDKMPFGTIFYVLFLLLFLFATVTFSVVMLEINVDNITNQD 

NSKRAKWSVILGILTFVFGIPSALSYGVMADVHIFGKTFFDAMDFLVSNLLMPFGALYLSLFTGYIFKKALAMEELHLD 

ERAWKQGLFQVWLFLLRFFVSSFQSSSLWSSLPNLCNQKGLEZ 



MUCKWQLKDVILLAFI^IFTGGVFVGSGYVYNII^LLLTPLGLQAFANEILFGLWCMAAPLAAIFVPRVGSA 

LAEVLYGSQFGLGALLSGFVQGLGSEFGFIVTKNRYESWLSLTANSIGITLVSFVYEYIKLGYYAFSLPFVLSLLVVRFISV 

YFFCTILVRAIVKLYHQFATGGKAZ 



MVKVATQTPIISLFLLILSLETSnPSIALTLSVVAFCILFMLYYRRFKMLAWMIILAILPSFANYWAVQLHGDASQAVML 
GTRAFVTVCIGLVFVSSVSLKELLLYLAQKGLSRSWSYALIVVFNSFPLIQQEIKSLKEACLLRGQELHFWSPLIYSKVLM 
TVFRWRHLYLRALSAHGYDEHAQLKNSYRTFYIPKKTKLIYLLFFLLLQTSLFLZ 



MRKHQLQVHKLTII^MMIALDVVLTPIFRIEGMAPMSSVVNILAGIMMGPVYALAMATVTAFIRMTTQGIPPLALTGAT 
FGALl^GLFYKYGRKFHYSALGEILGTGIIGSIVSYPVMVLFTGSAAKLSWFIYTPRFFGATLIGTAISFIAFRFLIKQEFFK 
KVQGYFFSERIDZ 



MQEFTNPFPIGSSSUHCITNEISCEMLANGILALGCKPVMADDSREVLDFTKQSQALFINLGHLSAEKEKAIRMAASYAN 
QSSLPMVVDAVGVTTSSIRKSLVKDLLDYRPTVLKGNMSEIRSLVGLKHHGVGVDASAKDQETEDLLQVLKDWCQTYP 
GMSFLVTGPKDLVVSKNQVAVLGNGCTELDWITGTGDLVGALTAVFLSQGKTGFEASCLAVSYLNIAAEKIVVQGMG 
LEEFRYQVLNQLSLLRRDENWLDTIKGEVYEZ 



MNHKIAILSDVHGNATALEAVIADAKNQGASEYWLLGDIFLPGPGANDLVALLKDLPJTASVRGNWDDRVLEALDGQ 
YGLEDPQEVQLLRMTQYLMERMDPATIVWLRSLPLLEKKEIEXiLRFSISHNLPDKNYGGDLLVENDTEKFDQLLDAET 
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DVAVYGHVHKQLLRYGSQGQQIINPGSIGMPYFNWEALKNHRSQYAVIEVEDGELLNIQFRKVAYDYEAELELAKSKG 
LPFIEMYEELRRDDNYQGHNLELLASUEKHGYVEDVKNFFDFLZ 

5 MNVNQIVWIPTLKANNRKLNETFYIETLGMKALL^ 

NPLEIEGILSKTDSIHRLYKGQNGYAFEIFSPEDDLILIHAEDDIASLVEVGEKPEFQTDLASISLSKFEISMELHLPTDIESF 
LESSEIGASLDFIPAQGQDLTVDNTVTWDLSMLKFLVNELDIASLRQKFESTEYFIPKSEKFFLGKDRNNVELWFEEVZ 

10 MKWTKinCKIEEQIEAGIYPGASFAYFKDNQWTEFYLGQSDPEHGLQTEAGLVYDLASVSKVVGVGTVCTFLWElGQLD 
IDRLVIDFLPESDYPDITIRQLLTHATDLDPFIPNRDLLTAPELKEAMFHLNRKSQPAFLYSDVHFLLLGFILERIFNQDLD 
VILKDQVWKPWGMTETKFGPVELAVPTVRGVEAGIVHDPKARLLGRHAGSAGLFSTIKDLQIFLEHYLADDFARDLNQ 
NFSPLDDKERSLAWNLEGDWLDHTGYTGTFIMWNRQKQEATIFLSNRTYEKDERAQWILDRNQVMNLIRKEEZ 

15 

MMKKTYNHILVWGVIFYSICIVCFCFTPQEQSTVGVGTPGIQHLGRLVFLLTPFNSLWKLGEVSDIGQLCWIFLQNILNV 
FLFFPLIFQLLYLFPNLRKTKKVLLFSFLVSLGIECTQLILDFFFDFNRVFEIDDLWTNTLGGYLAWLLYKRLHKNKVRN 
Z 

20 

MKIPLLTFARHKFVYVLLTLLFLALVYRDVLMTYFFFDIHAPDLAKFDGQAIKNDLLKSALDFR1LQFNLGFYQSFHPIII 
VLLGFQYIELKNKVLRLSIGREVSYQGLKRKLTLQVASIPCLIYLVTVLIIAIITYFFGTFSPLGWNSLFSDGSGLQRLLDGE 
IKSYLFFTCVLLIGIFlNAIYFLQIVDYVGNVTRSArTYLMFLWLGSMLLYSALPYYMVPMTSLMQASYGDVSLMKLFTP 
YILYIVPYMVLEKYEDNVZ 

25 

MFKVLQKVGKAFMLPIAILPAAGLLLGIGGAI^NPTTIATYPILDNSIFQSIFQVMSSAGEVVFSNLSLLLCVGLCIGLAKR 
DKGTAALAGVTGYLVMTATIKALVKLFMAEGSAIDTGVIGALVVGIVAVYLHNRYNNIQLPSALGFFGGSRFVPIVTSF 
SSILIGFVFFVIWPPFQQLLVSTGGYISQAGPIGTFLYGFLMRLSGAVGLHHIIYPMFWYTELGGVETVAGQTVVGAQKIF 
30 FAQLADLAHSGLFrEGTRFFAGRFSTMMFGLPAACLAMYHSVPKNRRKKYAGLFFGVALTSFITGITEPIEFMFLFVSPV 
LYVVHAFLDGVSFFIADVLNISIGNTFSGGVIDFTLFGILQGNAKTNWVLQIPFGLrWSVLYYIIFRWFITQFNVLTPGRGE 
EVDSKEISESADSTSNTADYLKQDSLQIIRALGGSNNIEDVDACVTRLRVAVKEVNQVDKALLKQIGAVDVLEVICGGIQ 
AIYGAKAILYKNSiNEILGVDDZ 

35 MKFRKLACTVLAGAAVLGLAACGNSGGSKDAAKSGGDGAKTEITWWAFPVFTQEKTGDGVGTYEKSIIEAFEKANPDI 
KVKLETIDFKSGPEKITTAIEAGTAPDVLFDAPGRIIQYGKNGKLAELNDLFTDEFVKDVNNENIVQASKAGDKAYMYPI 
SSAPFYMAMNKKMLEDAGVANLVKEGWTTDDFEKVLKALKDKGYTPGSLFSSGQGGDQGTRAFISNLYSGSVTDEKV 
SKYTTDDPKFVKGLEKATSWIKDNLINNGSQFDGGADIQNFANGQTSYTILWAPAQNGIQAKLLEASKVEVVEVPFPSD 
EGKPALEYLVNGFAVFNNKDDKKVAASKKFIQFIADDKEWGPKDVVRTGAFPVRTSFGKLYEDKRMETISGWTQYYSP 

40 YYNTIDGFAEMRTLWFPMLQSVSNGDEKPADALKAFTEKANETIKKAMKQZ 

MQSTEKKPLTAFTVISTIILLLLTVLFIFPFYWILTGAFKSQPDTIVIPPQWFPKMPTMENFQQLMVQNPALQWM 
SLVTMFLVCATSSLAGYVLAKKRFYGQRILFAIFIAAMALPKQVVLVPLVRIVNFMGIHDTLWAVILPLIGWPFGVFLM 
45 KQFSENIPTELLESAKIIX3CGEIRTFWSVAFPIVKPGFAALAIFTFIWWNDYFMQLVMLTSRNNLTISLGVATMQAEMA 
TN YG LIMAG AALAA VPIVTVFL VFQKS FTQGITMG A VKGZ 

MKIMFKNFNNILLNRKIVLLLRIVLMMIUNHLLSTAVQKQDAVIFFKRELISIFSYNDYSEANLEIPKLLLNI^LFMVGW 
50 LSVILLESDLADHYHHLIRYQSSSFFDYTRKRLVVISKFFTQDLFVWFLGLLPLGIHFKTVALFFLLAQLMMLYLLLSYLI 
ALISAGAGFSFFLYFLAFVGQEWMMDHIVTVYLVLLSLLVMLIVSRLEEKFKKGZ 

MGKGEMGKGVIGLEFDSEVLVNKAPTLQLANGKTATFLTQYDSKTLLFAVDKEDIGQEIIGIAKGSIESMHNLPVNLAG 
55 ARVPGGVNGSKAAVHEVPEFTGGVNGTEPAVHEIAEYKGSDSLVTLTTKKDYTYKAPLAQQALPETGNKESDLLASLG 
LTAFFLGLFTLGKKREQZ 

60 MKKTFFLLVLGLFCLLPLSVFAIDFKINSYQGDLYIHADNTAEFRQKIVYQFEEDFKGQIVGLGRAGKMPSGFDIDPHPKI 
QAAKNGAELADVTSEVTEEADGYTVRVYNPGQEGDIVEVDLVWNLKNLLFLYDDLAELNWQPLTDSSESIEKFEFHVR 
GDKGAEKLFFHTGKLFREGT1EKSNLDYTIRLDNLPAKRGVELHAYWPRTDFASARDQGLKGNRLEEFNKIEDSIVREK 
DQSKQLVTWVLPSII^ISLLL^VCFYFIYRRKTTPSVKYAKNHRLYEPPMELEPMVLSEAVYSTSLEEVSPLVKGAGKFTF 
DQLIQATLLDVIDRGNVS11SEGDAVGLRLVKEDGLSSFEKDCLNLAFSGKKEETLSNLFADYKVSDSLYRRAKVSDEKR 

65 IQARGLQLKSSFEEVLNQMQEGVRKRVSFWGLPDYYRPLTGGEKALQVGMGALTILPLFIGFGLFLYSLDVHGYLYLPL 
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PILGFLGLVLSVFYYWKLRLDNRDGVLNEAGAEVYYLWTSFENMLREIARLDQAELESIVVWNRLLVYATLFGYADK 
VSHLMKVHQIQVENPDINLYVAYGWHSTFYHSTAQMSHYASVANTASTYSVSSGSGSSGGGFSGGGGGGSIGAFZ 



MKKVRKIFQKAVAGLCCISQLTAFSSIVALAETPETSPAIGKVVIKETGEGGALLGDAVFELKNNTDGTTVSQRTEAQTG 

EAIFSNIKPGTYTLTEAQPPVGYKPSTKQWTVEVEKNGRTTVQGEQVENREEALSDQYPQTGTYPDVQTPYQIIKVDGS 

EKNGQHKALNPNPYERVIPEGTLSKRIYQVNNLDDNQYGIELTVSGKTVYEQKDKSVPLDVVILLDNSNSMSNIRNKNA 

RRAERAGEATRSLIDKITSDSENRVALVTYASTIFDGTEFTVEKGVADKNGKRLNDSLFWNYDQTSFTTNTKDYSYLKL 

TNDKNDIVELKNKVPTEAEDHDGNRLMYQFGATFTQKALMKADEILTQQARQNSQKVIFHITDGVPTMSYPINFNHAT 

FAPSYQNQLNAFFSKSPNKDGILLSDFITQATSGEHTIVRGDGQSYQMFTDKTVYEKGAPAAFPVKPEKYSEMKAAGYA 

VIGDPINGGYIWLNWRESILAYPFNSNTAK1TNHGDPTRWYYNGNIAPDGYDVFTVGIGINGDPGTDEATATSFMQSISS 

KPENYTNVTDTTKILEQLNRYFHTIVTEIGCSIENGTrrDPMGELIDLQLGTDGRFDPADYTLTANDGSRLENGQAV 

QNDGGLIJCNAKVLYDTTEKRIRVTGLYLGTDEKVTLTYW^ 

RDVRKYPEITISKEKKLGDIEFIKVNKNDKKPLRGAVFSLQKQHPDYPDIYGAIDQNGTYQNVRTGEDGKLTFKNLSDG 
KYRLFENSEPAGYKPVQNKPIVAFQrVNGEVRDVTSIVPQDIPAGYEFTNDKHYITNEPIPPKREYPRTGGIGMLPFYLIG 
CMMMGGVLLYTRKHPZ 



MKSINKFLTMLAALLLTASSLFSAATVFAAGTTTTSVTVHKLLATDGDMDKIANELETGNYAGNKVGVLPANAKEIAG 

VMFVWTOTNNEIIDENGQTLGVNIDPQTFKLSGAMPATAMKKLTEAEGAKFNTANLPAAKYKIYEIHSLSTYVGEDGA 

TLTGSKAVPIE1ELPLNDVVDAHVYPKNTEAKPKIDKDFKGKANPDTPRVDKDTPVNHQVGDVVEYEIVTKIPALANYA 

TANWSDRMTEGLAFNKGTVKVTVDDVALEAGDYALTEVATGFDLKLTDAGLAKVNDQNAEKTVKITYSATLNDKAI 

VEVPESNDVTFNYGNNPDHGNTPKPNKPNENGDLTLTKTWVDATGAPIPAGAEATFDLVNAQTGKVVQTVTLTTDKN 

TVTVNGLDKNTEYKFVERSIKGYSADYQEITTAGEIAVKNWKDENPKPLDPTEPKVVTYGKKFVKVNDKDNRLAGAEF 

VLANADNAGQYLARKADKVSQEEKQLVVTTKDALDRAVAAYNALTAQQQTQQEKEKVDKAQAAYNAAVIAANNAF 

EWVADKDNENVVKLVSDAQGRFEITGLLAGTYYLEETKQPAGYALLTSRQKFEVTATSYSATGQGIEYTAGSGKDDAT 

KVVNKKITIPQTGGIGTIIFAVAGAAIMGIAVYAYVKNNKDEDQLA2 



MTMQKMQKMISRIFFVMALCFSLVWGAHAVQAQEDHTLVLQLENYQEVVSQLPSRDGHRLQVWKLDDSYSYDDRV 

QIVRDLHSWDENKI^SFKKTSFEMTFLENQIEVSHIPNGLYYVRSIIQTDAVSYPAEFLFEMTDQTVEPLVIVAKKTDTM 

TTKVKLIKVDQDHNRLEGVGFKLVSVARDVSEKEVPLIGEYRYSSSGQVGRTLYTDKNGEIFVTNLPLGNYRFKEVEPL 

AGYAVTTLDTDVQLVDHQLVTITVVNQKLPRGNVDFMKVDGRTNTSLQGAMFKVMKEESGHYTPVLQNGKEVVVTS 

GKDGRFRVEGLEYGTYYLWELQAPTGYVQLTSPVSFTIGKDTRKELVTVVKNNKRPRIDVPDTGEETLVYLDACCHFV 

VWZ 



MSHIYLSIFTSLLLMLGLVNVAQADEYLRIGMEAAYAPFNWTQDDDSNGAVKIDGTNQYANGYDVQIAKKIAKDLGKE 
PLVVKTKWEGLVPALTSGKIDMIIAGMSPTAERKQEIAFSSSYYTSEPVLLVKKDSAYASAKSLDDFNGAKITSQQGVYL 
YNLIAQIPGAKKETAMGDFAQMRQALEAGVIDAYVSERPEALTAEAANSKFKMIQVEPGFKTGEEDTAIAIGLRKNDNR 
ISQINASIETISKDDQVALMDRMIKEQPAEATTTEETSSSFFSQVAKIL^ENWQQLLRG 

TAPLSENKVIYGLQKLVGWVLNVYIEIFRGTPMIVQSMVIYYGTAQAFGINLDRTLAAIFIVSINTGAYMTEIVRGGILAV 
DKGQFEAATALGMTHNQTMRKIVLPQVVRNILPATGNEFVINIKDTSVLNVISVVELYFSGNTVATQTYQYFQTFTIIAV 
I YFVLTFTVTRILRFIERRMDMDT YTTG ANQMQTED LKZ 



MTQAILEIKHLKKSYGQNEVLKDISLTVHKGEVISIIGSSGSGKSTFLRSINLLETPTDGQILYHGQNTVLEKGYDLTQYREK 
I^MVFQSFNLFENLNVLENTIVAQTTVLKRERTEAEKIAKENLEKVGMGERYWQAKPKQLSGGQKQRVAIARALSMN 
PDAILFDEPTSALDPEMVGEVLKIMQDLAQEGLTMIVVTHEMEFARDVSHRVIFMDKGVIAEEGKPEDLFTNPKEDRTK 
EFLQRYLKZ 



MKKYQLLFKISAVFSYLFFVFSLSQLTLIVQNYWQFSSQIGNLFWIQNILSLLFIGVMIVVLVKTGHGYLFRIPRKKWLW 
YSILTVLVLVFQISFNVQTAKHVQSTAEGWAVLIGYSGTNFAELGIYIALFFLVPLMEELIYRGLLQHAFFKHSRFGLDLL 
LPSILFALPHFSSLPSLLDIFVFATVGIIFAGLTRYTKSIYPSYAVHVINNIVATFPFLLTFLHRVLGZ , 



MNKKQWLGLGLVAVAAVGLAACGNRSSRNAASSSDVKTKAAIVTDTGGVDDKSFNQSAWEGLQAWGKEHNLSKDN 
GFTYFQSTSEADYANNLQQAAGSYNLIFGVGFALNNAVKDAAKEHTDLNYVLIDDVIKDQKNVASVTFADNESGYLA 
G VAAAKTTKTKQ VGFVGG IES E VIS RFEAGFKAG V AS V DPSIK VQ V D Y AGS FG D AAKGKTIAAAQ Y AAG ADI V YQ V AG 
GTGAGVFAEAKSLNESRPENEKVWVIGVDRDQEAEGKYTSKDGKESNFVLVSTUCQVGTTVKDISNKAERGEFPGGQV 
IVYSLKDKGVDLAVTNLSEEGKKAVEDAKAKILDGSVKVPEKZ 
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MSKKLQQISVPLISVFLGILLGAIVMWIFGYDAIWGYEELFYTAFGSLRGIGEIFRAMGPLVLIGLGFAVASRAGFFNVGL 
PGQAUVGWI1^GWFAI^HPDMPIU>LMII^TIVIAL^ 

FMQSTDSTIRVGANATYQTPWl^ELTGNSI^NIGIFFAIIAVAVIWFMLKKTTLGFEIRAVGLNPHASEYAGISAKRTHL 
SMnSGALAGLGGAVEGLGTFQNVYVQGSSLAIGFNGMAVSLLAANSPIGILFAAFLFGVLQVGAPGMNAAQVPSELVSI 
VTASIIFFVSVHYLIERFVKPKKQVKGGKZ 



MGVKKKLKLTSLLGLSLLIKfTACATNGVTSDITAESADFWSKLVYFFAEIIRFLSFDISIGVGIILFTVLIRTVLLPVFQVQ 
MVASRKMQEAQPRIKALREQYPGRDMESRTKLEQEMRKVFKEMGVRQSDSLWPILIQMPVILALFQALSRVDFLKTGH 
FLWINLGSVDTTLVLPILAAVFTFI^TWLSNKAI^ERNGATTAMMYGIPVUFIFAVYAPGGVALYWTVSNAYQVLQTY 
FLNNPFKIIAEREAVVQAQKDLENRKRKAKKKAQKTKZ 



MVIDPFAINELDYYLVSHFHSDHIDPYTAAAILNNPKLEHVKFIGPYHCGRIWEGWGVPKERIIVVKPGDTIELKDMKIH 
AVESFDRTCLVTLPVNGADETGGELAGLAVTDEEMAQKAVNYIFETPGGTIYHGADSHFSNYFAKHGKDFKIDVALNN 
YGENPVGIQDKMTSIDLLRMAENLRTKVHPVHYDIWSNFMASTNEILELWKMRKDRLQYDFHPFIWEVGGKYTYPQD 
QHLVEYHHPRGFDDCFEQDSNIQFKALL2 



MFLSGWLSSFANTYIHDLLGVLFPDSPFLNAFESAIAAPLVEEPLKLLSLVFVLALIPVRKLKSLFLLGIASGLGFQMIKDI 
GY1RTDLPEGFDFTISRILERIISGIASHWTFSGLAWGVYLLYRAYKGQKVGKKQGLIFLGLALGTHFLFNSPFVELETEL 
PLAIPVVTA1ALYGFYHAYCFVEKHNELMTZ 



MKVEPRCDVLSRMSHFFIR1LIMELQELVERSWAIRQAYHELEVKHHDSKWTVEEDLLALSNDIGNFQRLVMTKQGRY 
YDETPYTLEQKLSENIWWLLELSQRLDIDILTEMENFLSDKEKQLNVRTWKZ 



MLDWKQFFLAYLRSRSRLFIYLLSLAFLVLLFQFLFASLGIYFLYFFFLCCFVTILFFTWDILVETQVYRQELLYGEREAK 
SPLEIALAEKLEAREMELYQQR5KAERKLTDLLDYYTLWVHQIKTPIAASQLLVAEVVDRQLKQQLEQEIFKIDSYTNLV 
LQYLRLESFHDDLVLKQVQIEDLVKEIIRKYALFFIQKGLNVNLHDLDKEIVTDKKWLLVVIEQIISNSLKYTKEGGLEIY 
MDDQELCIKDTGIGIKNSDVLRVFERGFSGYNGRLTQQSSGLGLYLSKKISEELGHQrRIESEVGKGTTVRIQFAQVNLVL 
EZ 



MELOTHNAEILL^AANKSHYPQDELPEIALAGRSNVGKSSFINTMLNRKNLARTSGKPGKTQLLNFFNIDDKMRFVDVP 
GYGYARVSKKEREKWGCMIEEYLTTRENLRAVVSLVDLRHDPSADDVQMYEFLKYYEIPVIIVATKADKIPRGKWNKH 
ESAIKKFCLNFDPSDDFILFSSVSKAGMDEAWDAILEKLZ 



MTKKQLHLVI\rrGMSGAGKTVAIQSFEDLGYFTIDNMPPALLPKFLQLVEIKEDNPKLALVVDMRSRSFFSEIQAVLDEL 
ENQDGLDFKILFLDAADKELVARYKETRRSHPLAADGRILDGIKLERELI^PLKNMSQNVVDTTELTPRELRKTLAEQF 
SDQEQAQSFRIEVMSFGFKYGIPIDADLVFDVRFLPNPYYLPELRNQTGVDEPVYDYVMNHPESEDFYQHLLALIEPILP 
SYQKEGKSVLTIAMGCTGGQHRSVAFAKRLAQDLSKNWSVNEGHRDKDRRKETVNRSZ 



MRKPKITVIGGGTGSPVILKSLREKDVEIAAIVTVADDGGSSGELRKNMQQLTPPGDLRNVLVAMSDMPKFYEKVFQYR 

FSEDAGAFAGHPLGNLIIAGLSEMQGSTYNAMQLLSKFFHTTGKIYPSSDHPLTLHAVFQDGTEVAGESHIVDHRGIIDN 

VYVTNALNDDTPLASRRVVQTILESDMrVLGPGSLFTSILPNIVIKEIGRALLETKAEIAYVCNIMTQRGETEHFTDSDHV 

EVLHRHLGRPFIDTVLVNIEKVPQEYMNSNRFDEYLVQVEHDFVGLCKQVSRVISSNFLRLENGGAFHDGDLIVDELMR 

I 

IQVKKZ 



MKNLIKLLIIRLIVNLADSVFYIVALWHVSNNYSSSMFLGIFIAVNYLPDLLLIFFGPVIDRVNPQKILIISILVQLAVAVIFL 
LLLNQISFWVIMSLVFISVMASSISYVIEDVLIPQVVEYDKIVFANSLFSISYKVLDSIFNSFASFLQVAVGFILLVKIDIGIFL 
LALFILLLLKFRTSNANIENFSFKYYKREVLQGTKFILNNKLLFKTSISLTLINFFYSFQTVVVPIFSIRYFDGPIFYGIFLT1A 
GLGGILGNMLAPIVIKYLKSNQIVGVFLFLNGSSWLVAIVIKDYTLSLILFFVCFMSKGVFNIIFNSLYQQIPPHQLLGRVN 
TTIDSHSFGMPIGSLVAGTLIDLNIELVLLAISIPYFLFSYIFYTDNGLKEFSIYZ 



MMSNKNKEILIFAILYTVLFMFDGVKLLASLMPSAIANYLVYVVLALYGSFLFKDRLIQQWKEIRKTKRKFFFGVLTGW 
LFLILMTVVFEFVSEMLKQFVGLDGQGLNQSNIQSTFQEQPLLIAVFACVIGPLVEELFFRQVLLHYLQERLSGLLSIILV 
GLVFALTHMHSLALSEWIGAVGYLGGGLAFSIIYVKEKENIYYPLLVHMLSNSLSLHLAISIVKZ 
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LKKPIIEFKNVSKVFEKNTKVLKDINFELEEGKFYTLLGASGSGKSTIL^ 

TVFQSYALFPHMh4VFENVAFPLRUUCIDKKEIEQRVAEVLKMVQLEGYEKRSIRKLSGGQRQRVAIARAIINQPRVVLLD 
EPLSALDLKLRTDMQYELRELQQRLGITFVFVTHDQEEALAMSDWIFVMNDGEIVQSGTPVDIYDEPINHFVATFIGESN 
5 ILPGTMIEDYLVEFNGKRFEAVDGGMKPNEPVEVVIRPEDLRITLPEEGKLQVKVDTQLFRGVHYEIIAYDELGNEWMI 
HSTRKAIVGEEIGLDFEPEDIHIMRLNETEEEFDAR1EEYVEIEEQEAGLINAIEEERDEENKL2 



MKSMRILFLlALIQISLSSCFLWKECIL^FKQSTAm 
10 LLGAVLVLSDNLFIK^4TLQQELVDFLLPSFFFLFGLDLUFLPLKKYVRDFLAMLDRKKTVLVTILATLLFLRNPMTIVSL 
LIYIGLGLFFAAYLVPNSVKKEVSFYGHIFRDLVLVIVTLIFFZ 



MVKKIIGMVLALl^VTVVGVGVFAYTIYQQGTETLAKTYKKIGEETKVIEATEPLTILLMGVDTGKVERTETWV 
15 MILMTVNPICrKKTTMMSLERDU.TRIESGNGQAHEAKLNSAYADGGAELAIETIQKMMNIHrDRYVMVNMRGLQKLV 
DAVGGITVNNILGFPISISDQEEFNTISIGVGEQHIGGEEALVYARMRYQDPEGDYGRQKRQREVIQKVMEKALSLNSIGH 
YQEILKAl^DNMQTNIDl^AKSIPNLLGYKDSFKTrETQQLQGEGEILQGVSYQIVSRAHMLEMQhO-LRRSLGQEEVTQL 
ETNAVLFEDLFGRAPVGDEDNZ 

20 

MKKQAYVIIALTSFLFVFFFSHSLLEILDFDWSIFLHDVEKTEKFVFLLLVFSMSMTCLLALFWRGIEELSLRKMQANLK 
RLLAGQEWQVADPDLDASFKSLSGKLNLLTEALQKAENQSLAQEEEIIEKERKRIARDLHDTVSQELFAAHMILSGISQ 
QALKLDREKMQTQLQSVTAILETAQKDLRVLLLHLRPVELEQKSLIEGIQILLKELEDKSDLRVSLKQNMTKLPKKIEEHI 
FRILQELISNTLRHAQASCLDVYLYQTDVELQLKVVDNGIGFQLGSLDDLSYGLRNIKERVEDMAGTVQLLTAPKQGLA 
25 VDIRIPLLDKEZ 



MrVSIISQGFVWAILGLGIFMTFRILNFPDMTTEGSFPLGGAVAVTLITKGVNPFLATLVAVGAGCLAGMAAGLLYTKGK 
IPTLI^GILVMTSCHSIMLLIMGRANLGLLGTKQIQDVLPFDSDLNQLLTGLIFVSIVLALMLFFLDTKLGQAYIATGDNP 
30 DMARSFGIHTGRMELMGLVLSNGVIALAGALIAQQEGYADVSRGIGVrVVGLASLnGEVIFKSLSLAERLVTrVVGSIAY 
QFL V W A VIALG FNTS YLRL YS AULA VCLM IPTFKQTI LKG AK LSKZ 



MKKMKVWSTVLATGVALTTLAACSGGSNSTTASSSEEKADKSQELVIYSNSVSNGRGDWLTAKAKEAGFNIKMVDIAG 
35 AQLADRVIAEKNNAVADMVFGIGAVDSNKIRDQKLLVQYKPKWLDKIDQSLSDKDNYYNPVIVQPLVLIGAPDVKEMP 
KDWTELGSKYKGKYSISGLQGGTGRAILASILVRYLDDKGELGVSEKGWEVAKEYLKNAYTLQKGESSIVKMLDKEDPI 
QYGMMWGSGALVGQKEQNVVFKVMTPEIGVPFVTEQTMVLSTSKKQALAKEFIDWFGQSEIQVEYSKNFGSIPANKD 
ALKDLPEDTKKFVDQVKPQNIDWEAVGKHLDEV^VEKAELEYVQZ 

40 

MIKFDNIQIKYGDFVAIDNLNLDIHEGEFFTFLGPSGCGKSTTLRALVGFLDPSSGSIEVNGTDVTHLEPEKRGIGIVFQSY 
ALFPTNfTVFDNIAFGLKVKKVAPDVIKAKVSAVAAKIKISDQQLQRNVSELSGGQQQRVALARALVLEPKILCLDEPLS 
NLDAKLRVDLRKELKRLQKELGITTLYVTHDQEEALTLSDR1AVFNNGYIEQVGTPVEIYHNSQTEFVCDFIGDINVLTD 
ETVHEVLLKNTSVFLEDKKGYIRLEKVRFNRETEQDFILKGTIIDVEFSGVTIHYTIKVSESQILNVTSIDSQAAIRSVGESV 
45 ELFITPSDVLQFZ 



MRHKLNLKDWLIRLGLIWFLVTFIIYPNFDLVVNVFVKGGEFSLDAVHRVLKSQRALQSIMNSFKLAFSLIITVNVVG1L 
CVLFTEYFDIKGAKILKLGYMTSLIYGGVVLATGYKFVYGPYGLITKFLQNVIPSLDPNWFIGYGAVLFIMTFSGTANHT 
50 LFLTNTIRSVDYHTIEAARNMGAKPFTVFRKVVLPTUPTLFALTM 

NSRDLAALU^IILGIATTILLTIMNKIEKGGNYISISKTKAPLKKQKIASKPWNIIAHIVAYGLFTVFMLPLIFIVLYSFTDPV 
AIQTGNLTLSNFTLENYRLFFSNSAAFSPFLVSFIYSIIAATTATILAVVFARVVRKHKSRFDFLFEYGALLPWLLPSTLLA 
VSLLFTFNQPQFLVLNQILVGSLVILLIAYIVVKIPFSYRMVRAILFSVDDEMEDAARSMGASPFYTMMKVIIPFILPVVLS 
VIALNFNSLLTDFDLSVFLYHPLAQPLGITIRSAGDETATSNAQALVFVYTIVLMHSGTVLYFTQRPGRKVRKZ 
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Table 3 

ID201 - 4106.4 

ATGATAAAAAATCCTAAATTATTAACCAAGTCTTTTTTAAGAAGTTTTGCAATTCTAGGTGGTGTTGGTCTAGTCAT 
TCATATAGCTATTTATTTGACCTTTCCTTTTTATTATATTCAACT 

TGTTT ACGGAGTATTTAAAGACT AAG ACATCTGATG AAATTC CAAG CTT ACTCCAGTCTTATTCAAAGTC CTTGACC 

ATATCTGCTC^CCTTAAAAGAGATATTGTAGATAAGCGGCTCCCTCTTGTGCATGACTTGGATATTAAAGATGGAAA 

G CTAT CAAATTAT AT CGTGATGTTAG ATATGTCTGTTAGTACAGCAGATGGTAAACAGGTAAC CGTGCAATTTGTTC 

ACGGGGTGGATGTCTACAAACyU^GCAAAGAATATTTTGCTTTTGTATCTCCCATATACATTTTTGGTTACAA 

TTTTCCTTTGTTTTTTCrTATTTTTATACTAAACGCTTGCTCAATCCTCTTTTTTACATTTCAGAAGTGACTAGT^ 

AATGCAAGATTTGGATGACAATATTCGTTTTGATGAAAGTAGGAAAGATGAAGTTGGTGAAGTTGGAAAACAGATTA 

ATGGTATGTATGAGCACTTGTTGAAGGTTATTTATGAGTTGGAAAGTCGTAATGAGCAAATTGTAAAATTGCAAAAT 

CAAAAGGTTTCCTTTGTCCG CGGAGCATCACATGAGTTGAAAACC C CTTTAGCCAGTCTTAGAATTATCCTAGAGAA 

TATGCAGCATAATATTGGAGATTACAAAGATCATCCAAAATATATTGCAAAGAGTATAAATAAGATTGACCAGATGA 

GCCACTTATTAGAAGAAGTACTGGAGTCTTCTAAATTCCAAGAGTGGACAGAGTGTCGTGAGACCTTGACTGTTAAG 

CCAGTTTTAGTAGATATTTTATCACGTTATCAAGAATTAGCTCATTCAATAGGTGTTACAATTGAAAATCAATTGAC 

AGATGCTACCAGGGTCGTCATGAGTCTTAGGGCATTGGATAAGGTTTTGACAAACCTGATTAGTAATGCAATTAAAT 

ATTCAGAT AAAAATGGGCGTGT AATCATATC CGAG CAAG ATGG CT ATCTCTCTATCAAAAAT ACATGTGCGCCTCT A 

AGTGLACCAAGAACTAGAACATTTATTTGATATATTCTATCATTCTCAAATCGTGACAGATAAGGATGAAAGTTCCGG 

TTTGGK3TCTTTACATTGTGAATAATATTTTAGAAAGCTATCAAATGGATTATAGTTTTCTCCCTTATGAACACGGTA 

TGGAATTTAAG ATTAG CTTGT AG 



MIKNPKLLTKSFLRSFAILGGVGLVIHIAIYLTFPFYYIQLEGEKFNESARVFTEYLKTKTSDEIPSLLQSYSKSLT 
ISAHLKRDIVDKRLPLVHDLDIKDGKLSNYIVMLDMSVSTADGKQVTVQFVHGVDVYK^ 

FSFVFSYFYTKRLLNPLFYISEVTSKMQDLDDNIRFDESRKDEVGEVGKQINGMYEHLLKVIYELESRNEQIVKLQN 
QKVSFVROASHELKTPLASLRIILENMQHNIGDYKDHPKYIAKSINKIDQMSHLLEEVLESSKFQEWTECRETLTVK 
PVLVDI LSRYQELAHS IGVTIENQLTDATRWMSLRALDKVLTNLISNAIKYSDKNGRVI ISEQDGYLS IKNTCAPL 
SDQELEHLFDIFYHSQIVTDKDESSGLGLYIVNNILESYQMDYSFLPYEHGMEFKISLZ 

ID202 - 4106.9 

ATGGATAAAATTATTAAAACTATATCAGAAAGCGGAGCCTTTCGTGCTTTTGTCCTTGATAGCACTGAAACCGTCCG 
C ACTG CTC AAGAAAAACATCAAACCCAAGCT AGCTCAACTGTAG CG CTTGGTCGAACTCTTATCGCTAG CC AGATTC 
TCGCAGCCAATGAAAAAGGAAATACCAAACTTACAGTTAAGGTGTTGGGATCTAGCTCTCTAGGTGCTATTATCACC 
GTCGCTGATACCAAGGGGAACGTCAAAGGCTATGTTCAAAATCCTGGTGTTGACATCAAAAAGACTGCGACTGGTGA 
AGTC CT AGTCG GAC CTTTTGTTGG AAATGGTC AATTCCTCGTTAT CACAGACT ACGGT ACTGGAAATCCTTACAACT 
CTATAACTCCCCTCATCTCTGGAGAAATCGGTGAAGACCTTGCCTTTTACCTTACTGAAAGCCAACAAACGCCTTCA 
GCGGTCGGCCTCAATGTCCTTTTGGACGAGGAAGACAAGGTCAAGGTTGCAGGTGGTTTCCTAGTTCAAGTCTTGCC 
AGGAGCC^GAAAGAAGAGATTGCTCGCTTTGAAAAACGCATCCAAGAAATGCCAGCTATCTCTACTCTTCTCGAAA 
GCGACGACCATATCGAAGCCCTCCTCAAGGCTATCTACGGGGACGAAGCCTACAAGCGTCTTTCTGAAGAAGAAATC 
CGTTTCCAATGTGACTGTAGCCATGAACGCTTTATGAACGCTCTTGCCAGCCTTCCAAGCTCAGACTTACAGGAAAT 
GAAAGAGGAAGACCACGGGGCAGAAATCACTTGTCAATTCTGCCAAACTACTTACAACTTTGATGAAAAGGACCTGG 
AGGAACTCATTCGTGACAAATCTTAA 



MDKI I KTISESGAFRAFVLDSTETVRTAQEKHQTQASSTVALGRTLIASQILAANEKGNTKLTVKVLGSSSLGAI IT 
VADTKGNVKGYVQNPGVDIKKTATGEVLVGPFVGNGQFLVITDYGTGNPYNSITPLISGEIGEDLAFYLTESQQTPS 
AVGLNVLLDEEDKVKVAGGFLVQVLPGAKKEEIARFEKRIQEMPAISTLLESDDHIEALLKAIYGDEAYKRLSEEEI 
RFQCDCSHERFMNALASLPSSDLQEMKEEDHGAEITCQFCQTTYNFDEKDLEELIRDKS2 



ID203 - 4115 

ATGAAATCAATAACTAAAAAGATTAAAGCAACTCTTGCAGGAGTAGCTGCCTTGTTTGCAGTATTTGCTCCATCATT 
TGTATCTG CTCAAG AATCATCAACTTAC ACTGTT AAAG AAGGTG AT ACACTTTCAGAAATCG CTGAAACTCACAACA 
CAACAGTTG AAAAATTGG C AG AAAACAAC CAC ATTG ATAAC ATTCATTTG ATTTATGTTGAT C AAG AGTTGGTT ATC 
GATGGCCCTGTAGCGCCTGTTGCAACACCAGCGCCAGCTACTTATGCGGCACCAGCCGCTCAAGATGAAACTGTTTC 
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AG CT CCAGTAGCAG AAACTC CAGTAGTAAGTGAAACAGTTGTTT C AACTGTAAG CGGATCTGAAGCAG AAGCCAAAG 
AATGG ATCG CTCAAAAAGAATCAGGTGGTAGTATACAG CTAC AAATGGACGTT AT ATCGGACGTTAC CAATTAA 

5 MKSITKKIKATIAGVAALFAVFAPSFVSAQESSTYTVKEGDTLSEIAET^ 

DGPVAPVATPAPATYAAPAAQDETVSAPVAETPWSETWSTVSGSEAEAKEWIAQKESGGSIQLQMDVISDVTNZ 

ID204 - 4117.1 

10 ATGAATTTAGGAGAATTTTGGTACAATAAAATAAATAAGAACAGAGGAAGAAGGTTAATGAAGAAAGTAAGATTTAT 
TTTTTTAGCTCTGCTATTTTTCTTAGCTAGTCCAGAGGGTGCAATGGCTAGTGATGGT^ 

ATCTGAAAGAAGATGG CAGTCAAGCAGCAAATGAGTGGGTTTTTGATACT CATTATCAATCTTGGTTCT AT AT AAAA 
GCAGATGCTAACTATGCTGAAAATGAATGGCTAAAGCAAGGTGACGACTATTTTTACCTC7VAATCTGGTGGCTATAT 
GGCCAAATCAGAATGGGTAGAAGACAAGGGAGCCTTTTATTATCTTGACCAAGATGGAAAGA 

15 GGGTAGGAACTTCCTATGTTGGTGCAACAGGTGCCAAAGTAATAGAAGACTGGGTCTATGATTCTCAATACGATGCT 
TGGTTTTATATCAAAGCAGATGGACTVGCACGCAGAGAAAGAATGGCTCCAAATTAAAGGGAAGGACTATTATT^ 
ATCCGGTGGTTATCTACTGACAAGTCAGTGGATTAATCAAGCTTATGTGAATGCTAGTGGTGCCAAAGTACAGCAAG 
GTTGGCTTTTTGACAAACAATACCAATCTTGGTTTTACATCAAAGAAAATGGAAACTATGCTGATAAAGAATGGATT 
TTCGAGAATGGTCACTATTATTATCTAAAATCCGGTGGCTACATGGCAGCCAATGAATGGATTTGGGATAAGGAATC 

20 TTGGTTTTATCTCAAATTTGATGGGAAAATGGCTGAAAAAGAATGGGTCTACGATTCTCATAGTCAAGCTTGGTACT 
ACTTCAAATCCGGTGGTTACATGACAGCCAATGAATGGATTTGGGATAAGGAATCTTGGTTTTATCTCAAATCTGAT 
GGGAAAATAGCTGAAAAAGAATGGGTCTACGATTCTCATAGTCAAGCTTGGTACTACTTCAAATCCGGTGGTTACA^ 
G AC AGC CAATG AATGGATTTGGGAT AAGG AATCTTGGTTTT ACCT CAAAT CTGATGGGAAAATAGCTGAAAAAG AAT 
GGGTCTACGATTCTCATAGTCAAGCTTGGTACTACTTCAAATCTGGTGGCTACATGGGGAAAAATGAGACAGTAGAT 

25 GGTTATCAGCTTGGAAGCGATGGTAAATGGCTTGGAGGAAAAACTACAAATGAAAATGCTGCTTACTATCAAGTAGT 
GCCTGTTACAGCCAATGTTTATGATTCAGATGGTGAAAAGCTTTCCTATATATCGCAAGGTAGTGTCGTATGGCTAG 
AT AAGG ATAGAAAAAGTGATGACAAGCG CTTGGCTATT ACTATT TCTGGTTTGTCAGGCTATATGAAAACAGAAGAT 
TTACAAG CGC TAGATGCTAGTAAGGACTTTATCCCTTATT ATG AG AGTGATGG CCACCGTTTTTATCACTATGTGGC 
TCAGAATGCTAGTATCCCAGTAGCTTCTCATCrTrTCTGATATGGAAGTAGGCAAGAAATATTATTCGGCAGATGGCC 

30 TG CATTTTGATGGTTTTAAGCTTGAGAATC C CTTC CTTTTCAAAGATTTAACAGAGGCT ACAAACTACAGTGCTGAA 

GAATTGGATAAGGTATTTAGTTTGCTAAACATTAACAATAGCCTTTTGGAGAACAAGGGCGCTACTTTTAAGGAAGC 
CGAAGAACATTACCATATCAATGCTCTTTATCTCCTTGCCCATAGTGCCCTAGAAAGTAACTGGGGAAGAAGTAAAA 
TTGCCAAAGATAAGAATAATTTCTTTGGCATTACAGCCTATGATACGACCCCTTACCTTTCTGCTAAGACATTTGAT 
GATGTGGATAAGGGAATTTTAGGTGCAACCAAGTGGATTAAGGAAAATTATATCGATAGGGGAAGAACTTTCCTTGG 

35 AAACAAGGCTTCTGGT ATGAATGTGGAATATG CTTCAG ACCCTTATTGGGG CGAAAAAATTG CTAGTGTG ATGATGA 

AAATCAATG AG AAGCT AGGTGG C AAAG ATT AG 



MNLGEFWYNKINKNRGRRLMKKVRFIFLALLFFLASPEGAW 
40 ADANYAENEWLKQGDDYFYLKSGGYMAKSEWVEDKGAFYYLDQDGKMKRNAWVGTSYVGATGAKVIEDWVYDSQYDA 

WFYIKADGQHAEKEWLOIKGKDYYFKSGGYLLTSQWINQAYWASGAKVQQGWLFDKQYQSWFYIKENGNYADKEWI 

FENGHYYYLKSGGYMAAlJEWIWDKESWFYLKFDGia^AEKEWVYDSHSQAWYYFKSGGYMTANEWIWDKESWFYLKSD 

GKIAEKEWVYDSHSQAWYYFKSGGYMTANEWIWDKESWFYLKSDGKIAEKEWV^ 

GYQ1XSSDGKWLGGKTTNENAAYYQVVPVTANVYDSDGEKLSYISQG 
45 LQALDASKDFI PY YESDGHRF YHYVAQNAS I P V AS H LS DME VG KK Y Y S ADG LH FDG F KL EN P FL F KD LTEATN YS AE 

ELDKVFSLLNINNSLLENKGATFKEAEEHYHINALYLLAHSALESNWGRSKIAKDKNNFFGITAYDTTPYLSAKTFD 

DVDKGILGATKWIKENYIDRGRTFLGNKASGMNVEYASDPYWGEKIASVMMKINEKLGGKDZ 

ID20S - 4118.1 

50 ATGAAAAAATTAGGTACATTACTCGTTCTCTTTCTTTCTGCAATCATTCTTGTAGCATGTGCTAGCGGAAAAAAAGA 
TACAACTTCTGGTCAAAAACTAAAAGTTGTTGCTACAAACTCAATCATCGCTGATATTACTAAAAATATTGCTGGTG 
. ACAAAATTGACC TTCATAGT ATCGTT CCGATTGGGC AAGAC C CAC ACG AATACGAACCACTTCCTGAAGACGTTAAG 
AAAACTTCTGAGGCTAATTTGATTTTCTATAACGGTATCAACCTTGAAACAGGTGGCAATGCTTGGTTTACAAAATT 
GGTAGAAAATGCCAAGAAAACTGAAAACAAAGACTACTTCGCAGTCAGCGACGGCGTTGATGTTATCTACCTTGAAG 

55 GTCAAAATGAAAAAGGAAAAGAAGACCCACACGCTTGGCTTAACCTTGAAAACGGTATTATTTTTGCTAAAAATATC 
GC CAAAC AATTG AGCGC CAAAG ACCCTAAC AAT AAAGAATTCTATGAAAAAAATCTCAAAG AAT AT ACTG AT AAGTT 
AGACAAACTTGATAAAGAAAGTAAGGATAAATTTAATAAGATCCCTGCTGAAAAGAAACTCATTGTAACCAGCGAAG 
GAGCATTCAAATACTTCTCTAAAGCCTATGGTGTCCCAAGTGCTTACATCTGGGAAATCAATACTGAAGAAGAAGGA 
ACTCCTGAACAAATCAAGACCTTGGTTGAAAAACTTCGCCAAACAAAAGTTCCATCACTCTTTGTAGAATCAAGTGT 

60 GGATGACCGTCCAATGAAAACTGTTTCTCAAGACACAAACATCCCAATCTACGCTCAAATCTTTACTGACTCTATCG 
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CAGAACAAGGT AAAGAAGG CG ACAGCT ACTAC AGCATG ATG AAAT ACAACCTTGACAAGATTGCTGAAGG ATTGG CA 
AAATAA 



MKKLGTLLVLFLS AI I LVACASGKKDTTSGQKLKWATNS I IAD I TKNI AGDKI DLHS I VP IGQD PHEYE P LPEDVK 

KTS EANLI FYNGI NLETGGNAWFTKLVENAKKTENKDYFAVSDGVDVI YLEGQNEKGKEDPHAWLNLENGI I FAKNI 

AKQLSAKDPNNKEFYEKNLKEYTDKLDKLDKESKDKFNKIPAEKKLIVTSEGAFKYFSKAYGVPSAYIWEINTEEEG 

TPEQIKTLVEKLRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDSIAEQGI^ 

KZ 

ID206 - 4119.1 

ATGGAATGGTATAAAAAAATCGGACTTCTTGCAACTACAGGTTTAGCTT^ 

TGGTAAATCTGCGGATGG CACAGTG AC CAT CG AGTATTTCAACCAGAAAAAAGAAATG ACCAAAACCTTGGAAGAAA 

TCACTCGTGATTTTGAGAAGGAAAACCCTAAGATCAAGGTCAAAGTCGTCAATGTACCAAATGCTGGTGAAGTATTG 

AAGACACGCGTTCTCGCAGGAGATGTGCCTGATGTGGTCAATATTTACCCACAGTCCATCGAACTGCAAGAATGGGC 

AAAAGCAGGTGTTTTTGAAGATTTGAGCAACAAAGACTACCTGAAACGCGTGAAAAATGGCTACGCTGAAAAATATG 

CTGTAAACGAAAAAGTTTACAACGTTCCTTTTACAGCTAATGCTTATGGAATTTACTACAAC^AAGATAAATTCGAA 

GAAC TGGG CTTG AAGGTT C CTGAAAC CTGGGATGAATTTGAAC AGTT AGT CAAAGATATCGTTG CT AAAGGACAAAC 

AC(^TTTGGAATTGCAGGTGCAGATGCTTGGACACTCAATGGTTACAATCAATTAGCCTT^ 

GAGGAAAAGAAGCAAATCAATACCTTCGTTATTCTCAACCAAATGCCATTAAAT^ 

GATATCAAGGTCATGGACATCCTTCGCATCAATGGATCTAAGCAAAAGAACTGGGAAGGTGCTGGCTATACCGATGT 
TATCGGAGCCTTCG CACGTGGGGATGTCCT CATGACAC CAAATGGGTCTTGGGCG ATCACAGCG ATTAATGAACAAA 
AACCGAACTTTAAGATTGGGACCTTCATGATTCC^GGAAAAGAAAAAGGACAAAGCTTAACCGTTGGTGCGGGAGAC 
TTGGCATGGTCTATCTCAGCCACCACCAAACATCCAAAAGAAGCCAATGCCTTTGTGGAATATATGACCCGTCCAGA 
AGTCATGCAAAAATACTACGATGTGGACGGATCTCCAACAGCGATCGAAGGGGTCAAACAAGCAGGAGAAGATTCAC 
CGCTTGCTGGTATGACCGAATATGCCTTTACGGATCGTCACTTGGTCTGGTTGCAACAATACTGGACCAGTGAAGCA 
GACTTCCATACCTTGACCATGAACTATGTCTTGACCGGTGATAAACAAGGCATGGTCAATGATTTGAATGCCTTCTT 
TAAC CCGATGAAAG CGG ATGTGGATTAG 



MEWYKKIGLLATTGLALFKUXSACSbTYGKSADGTVTIEYFNQ 

KTRVLAGDVPDWNIYPQSIELQEWAKAGVFEDLSNKDYLKRVKNGYAEKYAVNEKVYNVPFTANAYGIYYNKDKFE 
ELGLKVPETWDEFEQLVKDIVAKGQTPFGIAGADAWTLNGYNQLAFATATGGG^ 

DIKVMDILRINGSKQKNWEGAGYTDVIGAFARGDVLMTPNGSWAITAINEQKPNFKIGTFMIPGKEKGQSLTVGAGD 
LAWSISATTKHPKEANAFVEYMTRPEVMQKYYDVDGS PTAIEGVKQAGEDSPLAGMTEYAFTDRHLVWLQQYWTSEA 
DFHTLTMNYVLTGDKQGMVNDLNAFFNPMKADVDZ 

ID207 - 4123.1 

ATGAAGAAAATCAAACCGCATGGACCGTTACCAAGTCAGACTCAGCTAGCTTATCTGGGAGATGAACTAGCAGCTTT 
TATCCACTTCGGTCCTAATACCTTTTATGACCAAGAATGGGGGACTGGACAGGAGGATCCTGAGCGCTTTAACCCGA 
GTCAGTTGGATGCGCGTGAGTGGGTTCGTGTGCTCAAGGAAACGGGCTTCAAAAAGTTGATTTTGGTGGTCAAGCAC 
CACGATGGCTTTGTCCTTTATCCGA(^GCTCACACAGATTATTCGGTTAAGGTCAGTCCTTGGAGGAGAGGAAAGGG 
CGACTTGCTCCTTGAAGTATCCCAAGCTGCCACAGAGTTTGATATGGATATGGGGGTCTACCTGTCACCGTGGGATG 
CCCATAGTCCCCTCTATCATGTGGACCGAGAAGCGGACTACAATGCCTATTATCTGGCTCAGTTGAAGGAAATCTTA 
TCAAATCCTAACTATGGG AATGCTGGTAAGTTCG CTGAGGTTTGGATGGATGGTG CCAG AGQAGAGGGCGCG CAAAA 
GGTT AATT ATG AATTTG AAAAATGGTTTGAAACC ATT CGTGACCTGCAGGGCGATTG CTTGATTTTTT CAACAG AAG 
GCACC^GTATCCGCTGGArTGGCAATGAACGAGGGTATGCAGGTGATCC^CTGTGGCAAAAGGTGAATCCTGATAAA 
CT AGG AACAGAAGCAGAGCTGAACTAT CTTCAG C ACGGGGATCCCTCGGGCACG ATTTTTT CAATCGGAG AGG CAG A 
TGTTTCCATCCGTCCAGGCTGGTTCTACCATGAGGATCAGGATCCTAAGTCTCTCGAGGAGTTGGTCGAAATCTACT 
TTCACTCAGTAGGGCGAGGAACrCCACTCTTGCTTAATATTCCGCCGAATCAAGCTGGGCTCTTTGATGCAAAGGAT 
ATTGAACG ACTTTATG AATTTGCG ACCTAT CG C AATGAGCTCT ATAAAG AAGATTTGGCTCTGGG AG CTG AGGTATC 
TGGTCCAGCTCTTTCCGCAGACTTTGCTTGTCGCCATTTGACAGACGGCCTTGAGACCAGCTCTTGGGCAAGCGATG 
CAGACTTGCCCATCCAGTTAGAACTCGACTTAGGTTCTCCTAAAACTTTTGATGTAATTGAGTTAAGAGAAGATTTG 
AAGCTAGGGCAACGAATCGCTGCTTTTCATGTGCAAGTAGAGGTGGATGGTGTCTGGCAGGAGTTTGGTTCGGGTCA 
TACTGTTG GTTACAAACGT CTCTT ACGAGG AGCAGTTGTTGAGGCACAGAAG ATACGTGT AGT CATT AC AGAATCAC 
AGGCTTTGCCTTTGTTGACCAAGATTTCCCTTTATAAAACTCCTGGATTATCAAAAAAAGAAGTTGTTCAGGAACTA 
GCATTTGCAGAAAAAAGCCTAGCTGTGGCAAAGGGAGAAAATGCCTATTTTACAGTTAAGCGCAGAGAATGTAGTGG 
TCCTTTAGAAGCTAAGATTTCGATTCAACCGGGGACAGGTGTCCATGGTGTCGCCTATCAGGATGAGATTCAAGTCC 
TTGCGTTTCAAACTGGTGAGACTGAAAAAAGTCTGACGCTACCAACCTTGTATTTCGCAGGAGATAAAACCTTGGAT 
TTCTATCTGAACCTAACGGTGGATGGTCAGCTTGTGGATCAACTTCAAGTCCAAGTTTCATAA 
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MKKIKPHGPLPSQTQLAYLGDELAAFIHFGPNTFYDQEWGTGQEDPERTO^ 

HDGFVLYPTAHTD YSVKVS PWRRGKGDLLLEVSQAATE FDMDMGVYLS PWDAHS PLYHVDREADYNAYYLAQLKE I L 
SNPNYGNAGKFAEVVMDGARGEGAQKVNYEFEKWFETIRDLQGDC^ 

LGTEAELNYIiQHGDPSGTIFSIGEADVSIRPGWFYHEDQDPKSLEELVEIYFHSVGRGTPLLLNIPPNQAGLFDAKD 
IERLYEFATYRNELYKEDL.ALGAEVSGPALSADFACRHLTDGLETSSWASDADLPIQLELDLGSPKTFDVIELREDL 
KLGQRIAAFHVQVEVDGVWQEFGSGHTVGYKRLLRGAWEAQKIRWITESQALPLLTKISLYKTPGLSKKEW 
AFAEKSLAVAKGENAYFTVKRRECSGPLEAKISIQPGTGVHGVAYQDEIQVLAFQTGETEKSLTLPTLYFAGDFCTLD 

FYLNLTVDGQLVDQLQVQVSZ 



ID208 - 4125.12 

ATGCTTGAAAGACTGAAAAG AATAC ATTAT ATGTTTTGG ATC AGTTT AATTTTTATGATTTT CCC CATCCTGTCTGT 
AGTGACTGGGTGGCTTTCTGCCTGGCATTTATTGATTGATATTCTATTTGTAGTGGCATATTTGGGTGTTTTAACAA 
CTAAGAGCCAGCG CCTATCTTGGCTATATTGGGG CCT CATG CTG ACTTATGT AGTTGGGAAT ACTG CCTTTGTTGCT 
15 GTTAATTATATCTGGTTTTTCTTTTTCCTATCCAATCTCTTAAGTTATCATTTCAGCGTACGTAGTTTAAAGTCTTT 

ACATGTCTGGACTTTTCTTCTTGCTCAAGTCCITGTTGTGGGGCAACT 

TTCTATTCTATCTACTTGTAATTCTTACTTTTGTCGATTTAATGACTTTTGGATTGGTTCGGATTCGTATTGTCGAG 
GATTTGAAAGAAGCTCAGGTCAAGCAAAATGCTCAGATAAATCTATTGCTTGCTGAAAATGAACGTAGTCGTATCGG 
TCAGGATTTGCATGATAGTCrGGGACATACCTTTGCTATGCTGAGTGTCAAGACAGATTTAGCCTTGCT^GTTATTTC 
20 AGATGGAGGCTTATCCACAGGTGGAAAAGGAATTAAAAGAAATTCACCAGATCAGCAAGGATCCATGA 

MLERLKRIHYMFWISLIFMIFPILSVVTGWLSAWHLLIDILFWA^ 

VNYIWFFFFLSNLLSYHFSVRSLKSLHVWTFLLAQVLWGQLLIFQRIEVEFLFYLLVILTFVDLMTFGLVRIRIVE 
25 DLKE AQ VKQNAQ I NLL LAEN E RS R I GQD LHD S LGHT F AMLS V KTD LALQ L FQME A Y PQVE KE LKEIHQISKDPZ 

ID2Q9 - 4126.3 

ATGAATGATAAGTTAAAAATCTTCTTGTTGCTAGGAGTATTTTTTCTAGCCATAACCGGTTTCTATGTTCTATTGAT 
ACGAAATG CAGGGCAG ACAGATGCCTCG CAAATTGAAAAGGCGGCAGTTAG CCAAGGAGG AAAAGC AGTG AAAAAAA 
30 CAGAAATTAGTAAAGACGCAGACTTGCACGAAATTTATCTAGCTGGAGGTTGTTTCTGGGGAGTGGAGGAATATTTC 
TCACGTGTTCCCGGGGTGACGGATGCCGTTTCAGGCTATGCAAATGGTAGAGGAGAAACAACCAAGTACGAATTGAT 

TAAC CAAACAGGTCATGCAGAAAC CGT C C ATGTCAC CTATGATG CCAAGCAAATTT CTCTCAAGGAAATCCTGCTTC 
ACTATTTCCGCATTATCAATCCAACCAGCAAAAATAAACAAGGAAATGATGTGGGGACCCAGTACCGTACTGGTGTT 
TATTACACAGATGACAAGGATTTGGAAGTGATTAACCAAGTCTTTGATGAGGTGGCTAAGAAATACGATCAAC 

3 5 AGCAGTTGAAAAGGAAAACTTGAAGAATTTTGTGGTGGCTGAGGATTACCATCAAGACTATCTCAAGAAA 

ATGGCTACTGCCATATCAATGTTAATCAGGCGGCCTATCCTGTCATTGATGCCAGC^VAATATCCAAAACCAAGTGAT 
GAGGAATTGAAAAAGACCCTGTCACCTGAGGAGTATGCAGTTACCCAGGAAAATCAAACAGAACGAGCTTTCTCAAA 
CCGTTACTGGGATAAATTTGAATCCGGTATCTATGTGGATATAGCAACTGGGGAACCTCTCTTTTCATCAAAAGACA 
AATTTGAGTCTGGTTGTGGCTGGCCTAGTTTTACCCAACCCATCAGTCCAGATGTTGTCACCTACAAGGAAGATAAG 

40 TCCTACAATATGACGCGTATGGAAGTGCGGAGCCGAGTAGGAGATTCTCACCTTGGGCATGTCTTTACGGATGGTCC 
ACAGG AC AAGGGCGG CTT ACGTT ACTGT AT CAATAG C CTCTCTAT CCGCTTT ATTC CCAAAGACCAAATGG AAG AAA 
AAGg CT ACGCTTATTTACTAGATTATGTTGATTAA 

45 MNDKLKIFLLLGVFFLAITGFYVLLIRNAGQTDAS 

SRVPGVTDAVSG YANGRGETTKYELINQTGHAETVHVTYDAKQI SLKE I LLHYFRI INPTS KNKQGNDVGTQYRTGV 
YYTDDKDLEVINQVFDEVAKKYDQPIAVEKENLKNFVVAEDYHQDYLKKNPNGYCHINVNQAAYPVIDASKYPKPSD 

E ELKKTLS P E E Y AVTQ ENQT E RAF SNR YWDKFES G I YVD I ATGE P L FS S KD K FE S G CGW P S FTQ P I S P D WT YKE D K 
SYNMTRMEVRSRVGDSHLGHVFTDGPQDKGGLRYCINSLSIRFIPKDQMEEKGYAYLLDYVDZ 

50 

ATGAAAAAGAAATGGATGTATTATGCTGCTTGTTCT 

CGGCGGTTCGCTAGTCGTTTATTCACCAAACTCAGAGGGCTTAATTGGAGCAACTATTCCTGCCTTTGAAGAAAAAT 
ATGGTATCAAAGTAGAACTGATTCAAGCTGGTACTGGAGAACTTTTCAAAAAACTAGAGTCAGAAAAAGAAGTTCCT 

55 GTAGCTGATGTTATCTTTGGTGGTTCTTATACACAATATACTACCCACGGAGAACTCTTTGAAAACTATACTTCAAA 
AGAAAATGATAATGTTATCAAAGAATATCAAAACACAACTGGCTACTCTACTCCTTATACACTAGATGGTAGTGTTT 
TAATCGTCAACCCTGATTTAACTAAAGGCATGAACATCGAAGGATATAACGATCTTTTCAAACCTGAACTAAAAGGA 
AAAATCGCAACTGCTGACCCAGCAAACTCTTCTAGCGCCTTTGCTCAATTAACAAATATGCTACAAGCTCAAGGTGG 
TTACAAAGATGATAAGGCTTGGTCTTATGTAAAAGATCTTTTCACACTTATTGATGGTAAAATCGGTTCAAGTTCAT 

60 ctAGTGTCTATAAAGTAGTCGCTGATGGAGAAATGGCTGTTGGTCTCTCTTATGAAGATCCAGCAGTTAAACTCTTA 
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AATGACGGAGCTAACATTAAGGTAGTCTATCCAAAAGAAGGAACCGTCTTCCTACCTGCTAGTGCTGCTATCGTTAA 
AAAATCTAAAAATATGGAAAATGCCAAGAAATTTATCGATTTTATTATCTCTCAAGAAGTACAAGATACACTTGGTA 
CAACCACTACTAACCGTCCTGTTCGTAAAAATGCTAAAACAAGCGAAAACATGAAACCAATTGACAAAATCAAAACA 
CTCACTGAAGATTATGATTATGTCATCAAGAATAAATCAGATATCGTTAAGAAATACAACGAAGTCTTTACAGATAT 
CCAATCTAAACAGTAA 



MKKKWMYYAACSSNESADDSSSDKGDGGSLWYSPNSEGLIGATIPAFEEKYGIKVELIQAGTGELFKKLESEKEVP 
VADVIFGGSYTQYTTHGELFENYTSKENDNVIKEYQNTTGYSTPYTLDGSVLIVNPDLTKGMNIEGYNDLFKPELKG 
KIATADPANSSSAFAQLTNMLQAQGGYKDDKAWSWKDLFTLIDGKIGSSSSSVYKVVADGEI^VGLSYEDPAVKLL 
NDGANI KWYPKEGTVFLPASAAIVKKSKNMENAKKFIDFI ISQEVQDTLGTTTTORPVRKNAKTSENMKPIDKIKT 
LTEDYDYVIKNKSDIVKKYNEVFTDIQSKQZ 

ID211 - 4127.2 

ATGAGTGAGATCAAAATTATTAACGCCAAAAAAATCTACCACGATGTCCCTGTTATTGAGAATTTGAACATTACAAT 

TCCAAAAGGAAGTCTCTTTACCCTTCTTGGAGCTTCAGGATGTGGGAAAACGACCCTTCTTCGTATGATTGCAGGTT 

TCAACAGTATCGAAGGTGGAGAATTTTACTTCGATGATACAAAAATCAATAATATGGAACCCAGCAAACGCAATATC 

GGGATGGTTTTCCAAAACTACGCTATTTTCCCACATTTGACTGTCCGAGACAACGTTGCTTTTGGTCTTATGCAAAA 

GAAGGTTCCAAAAGAAGAATTGATTCAACAGACCAACAAGTATCTTGAACTCATGCAAATTGCTCAATATGCGGATC 

GAAAGCCCGATAAACTCAGTGGTGGACAACAACAACGTGTCACCTTGGCATGCGCCTTAGCGGTTAATCCAAGTGTT 

CTCCTCATGGACGAGCCACTTAGTAATCTGGAGGCCAAACTTCGCTTGGATATGCGTCAAGCCATCCGAGAAATCCA 

ACACGAAGTGGG AATTACAACTG TTTATGT AACCCACG AC CAAG AAG AAG CCATGGCTATTTCAGACCAAATTG CTG 

TTATGAAAGATGGGGTGAT C CAACAAATCGG CCG AC CAAAAG AACT CTAT CAT AAACCAGCTAATG AGTTTGTGGCA 

ACCTTTATCGGACGCACAAATATTATCCCTGCCAATCTTGAAAAACGGAGCGACGGCGCITATATCGTCTTTTCAGA 

TGGCTATGCCCTTCGAATGCCAGCTCTTGATCAGGTTGAGGAGCAAGCTATTCATGTAAGCATTCGTCCCGAAGAGT 

TTATCAAAGATGAATCTCGAGATATTGAAGGAACTATTAGAGATAGCGTCTATCTTGGACrAAA 

ATTG AG ACAGGTTTTG CCTCAAAAATTCAAGTT AGTGAAGAATCAACTTT TG AAGAAG ATCTACAAAAAGGCAATCG 

TATTCGTCTACGAATCAATACGCAAAAATTAAACATCTTTTCTGCAGATGGTTCCCAAAACCTGATAAAAGGAGTCA 

ACCATGGAACGTAA 

MSEIKI INAKKI YHDVPVIENLNITIPKGSLFTLLGASGCGKTTLLJU^IAGFNSIEGGEFYFDDTKINNMEPSKRNI 

GWFQNYAIFPHLTVRDOTAFGLMQKKVPKEELIQOTNKYLEU^QIAOYADRKPDKLSGGQQQRVT 

LUVIDEPLSNLEAKLRLDMRQAIREIQHEVGITTVY^ 

TFIGRTNI IPANLEKRSDGAYIVFSDGYALRMPALDQVEEQAIHVSIRPEEFIKDESGDIEGTIRDSVYLGLNTDYF 
I ETGFAS KI QVSEESTFEEDLQKGNRI RLRINTQKLN I FS ADGSQNL I KGVNHGTZ 

ID212 - 4136.1 

ATGAAG AAAAAATTATTGGCAGGTG C CATCACACTATT ATCAGT AGC AACTTTAGCAG CTTGTT CGAAAGGGTCAG A 
AGGTGCAGACCTTATCAGCATGAAAGGGGATGTCATTACAGAACATCAATTTTATGAGCAAGTGAAAAGCAACCCTT 
CAGCCCAACAAGTCTTGTTAAAT ATGAC CATC CAAAAAGTTTTTG AAAAACAATATGG CTCAGAGCTTGATGAT AAA 
G AGGTTG ATG AT ACTATTGC CGAAGAAAAAAAACAATATGGCG AAAACTACCAACGTGTCTTGTCACAAG CAGGTAT 
GACTCTTGAAACACGTAAAGC TCAAATTCGTACAAGT AAATTAGTTGAGTTGGCAGTTAAG AAGGTAGCAGAAG CTG 
AATTGACAGATGAAGCCTATAAGAAAGCCTTTGATGAGTACACTCCAGATGTAACGGCTCAAATCATCCGTCTTAAT 
AATGAAGATAAGGCCAAAGAAGTTCTCGAAAAAGCCAAGGCAGAAGGTGCTGATTTTGCTCAATTAGCCAAAGATAA 
TTCAACTGATGAAAAAACAAAAGAAAATGGTGGAGAAATTACCTTTGATTCTGCTTCAACAGAAGTACCTGAGCAAG 
TCAAAAAAGCCGCTTTCGCTTTAGATGTGGATGGTGTTTCTGATGTGATTACAGCAACTGGCACACAAGCCTACAGT 
AGCCAATATTACATTGTAAAACTCACTAAGAAAACAGAAAAATCATCTAATATTGATGACTACAAAGAAAAATTAAA 
AACTGTTATCTTGACTCAAAAACAAAATGATTCAACATTTGTTCAAAGCATTATCGGAAAAGAATTGCAAGCAGCCA 
ATATCAAGGTTAAGGACCAAGCCTTCCAAAATATCTTTACCCAATATATCGGTGGTGGAGATTCAAGCTCAAGCAGT 
AGTACATCAAACGAATAG 

MKKKLLAGAITLLSVATLAACSKGSEGADLISMKGDVITEHQFYEQVKSNPSAQQVLLNMTIQKVFEKQYGSELDDK 
EVDDTIAEEKKQYGENYQRVLSQAGMTLETRKAQIRTSKLVEIJVVKKVAEAELTDEAYKKAFDEYTPDVTAQIIRLN 
NEDKAKEVLEKAKAEGADFAQLAKDNSTDEKTKENGGEITFDSASTEVPEQVKKAAFALDVDGVSDVITATGTQAYS 
SQYYIVKLTKKTEKSSNIDDYKEKLKTVILTQKQNDSTFVQSIIGKELQAANIKVKDQAFQNIFTQYIGGGDSSSSS 
STSNEZ 
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ID213 - 4137.3 

ATG AAAAAAAAT ATT AAA C AAT ATG T AAC CT T AGGT ACTG T AG TGG T ATT AT CAGCATTTGT TGCT AAC T CAG TTG C 
AGCTCAGGAGACTGAAACTTCTGAAGTATCAACACCAAAGTC^ 

AAGTACAACCTACATCGGATAACTCTTCGGAAGTTACTGTACAACCTCGAACAGTTGAAACTACTGTTAAGGATCCA 
TCTTCTACAGCGGAAGAAACTCCTGTCTTAGAAAAAAATAATGTTACriTAACAGGGGGCGGAGAAAATGTTACTAA 
AGAGTTAAAGGATAAATTTACTAGCGGTGACTTTACTGTAGTGATTAAGTACAATCAGTCAAGTGAGAAAGGCITAC 
AAGCTCTGTTTGGAATATCTAATTCCAAACCCGGTCAACAAAATAGTTATGTAGATGTGTTCCTTAGAGACAATGGT 
GAGTTGGGGATGGAAGCGCGTGATACTTCTTCCAATAAAAATAACCTAGTATCCAGACCTGCTTCAGTTTGGGGTAA 
GT ACAAACAAG AGG CTGTGACTAACACTGTTGC AGTAGT AGCAGATTCAGT CAAAAAAACAT ATTCTTTAT ACGCAA 
AT GGT AC AAAAG T AGT AG AAAAG AAAG TGG AT AATTTC CT AAACAT CAAG GAT ATTAAAGGT ATTGATT ACT AT ATG 
CTTGGGGGAGTGAAACGTGCAGGAAAAACGGCGTTTGGTTTTAACGGAACACTAGAAAATATCAAAT^ 
TGCATTGGATGAAGAAACTGTTAAAAAGATGACAACAAACGCTGTTACTGGA 

CAACAGGTTCTAACTATTTCCGTATrCCAGTTCTGTATACTTTTAGCAATGGTCGGGTATTTTCAAGCATTGACGCT 
CGTT ACGG TGG AA CTCATGATTT CTTGAAT AAAATTAAT ATTG CT ACAAG T T AT AGTG ATG AT AATGG T AAG ACATG 
G ACT AAAC CAAAATT AACATTGG CATT CG ATGATTTTG CG C CAG T AC CATT AG AATGG C CT CG TG AAG TTGGTGG AC 
GTGACTTACAAATCAGCGGTGGTGCAACCTATATTGACTCTGTTATTGTTGAAAAAAAGAACAAACAAGTACTCATG 
TTTGCTGATGTGATGCCTGCTGGAGTAAGTTTTAGAGAAGCAACTAGAAAAGATTCAGGTTATAAACAAATT 
TAATTATTACCTTAAATTAAGGAAACAAGGTGATACTGATTACAATTATACTATTCGTGAGAATGGTACTGTATACG 
ACGATCGTACCAACAGACCAACTGAATTTTCAGTAGATAAAAATTTCGGTATTAAACAAAATGGTAATTATTTGACG 
GTAGAGCGG 

MKKNIKQWTLGTVWLSAFVANSVAAQETETSEVSTPKLVQPVAPTTPISEVQPTSDNSSEVTVQPRWETTVKDP 
SSTAEETPVLEKNNVTLTGGGEWTKELKDKFTSGDFTWIKYN^^ 

ELGMEIARDTSSNKIWLVSRP ASVWGKYKQEAVTNTVAWADS VKKTYS LYANGTKWEKKVDNFLN I KDI KGI DYYM 
25 LGG V KRAG KT A FG FNGT L EN I KFFNSALDEETVKKMTTNAVTGHLI YTANDTTGSNYFRI PVLYT FSNGRVFS S I DA 

RYGGTHDFLNKINIATSYSDDNGKTWTKPKLTLAFDDFAPVPLEWPREVGGRDLQISGGATYIDSVIVEKKNKQVLM 
FADVMPAGVS FREATR KDSG Y KQ I DGN Y Y L KLRKQGDTD YN YT I RENGTV YDD RTNR PTE FS VDKN FG I KQNGNYLT 
VER 

30 ID214 - 41Q5 

ATGAAAAAATTTAGCCTATTACTAGCTATCCTACCATTTTTGGTTGCCTGTGAGAATCAAGCTACACCCAAAGAGAC 
TAG CG CTC AAAAGACAATCGTC CTTGCTACAGCTGG CG ACGTG C CACCATTTGACT ACGAAG ACAAGGGCAATCTGA 
CAGGCTTTG AT ATCGAAGTTTT AAAGGCAGT AGATGAAAAACT CAGCG ACTACGAG ATTCAATT CCAAAGAAC CGCC 
TGGGAGAGCATCTTCCCAGGACTTGATTCTGGTCACTATCAGGCTGCGGCCAATAACTTGAGTTACACAAAAGAGCG 

35 TGCTGAAAAATACCTTTACTCGCTTCCAATTTCCAACAATCCCCTCGTCCTTGTCAGCAACAAGAAAAATCCTTT 
CTTCTCTTGACCAGATCGCTGGTAAAACAACACAAGAGGATACCGGAACTTCTAACGCTCAATTCA^ 
AAT CAG AAACAC ACTGAT AATCCCG CT AC AATTAATTTTTCTGGTGAGG ATATTGGTAAACGAATCCTAGACCTTGC 
TAACGGAGAGTTTGATTTCCTAGTTTTTGACAAGGTATCCGTTCAAAAGATTATCAAGGACCGTGGTTTA 
CAGTCGTTGATTTACCTTCTGCAGATAGCCCCAGCAATTATATCATTTTCTCAAGCGACCAAAAAGAGTTTAAAGAG 

40 CAATTTGATAAAGCGCTCAAAGAACTCTATCAAGACGGAACCCTTGAAAAACTCAGCAATACCTATCTAGGTGGTTC 
TTACCT CCC AGAT CAATCTCAGTTACAAT AA 



5 
10 
15 
20 



MKKFSLLLAILPFLVACENQATPKETSAQKTIVLATAGDVPPFDYEDKGNLTGFDIEVLKAVDEKLSDYEIQFQRTA 
45 WESIFPGIJDSGHYQAAA^LSYTKERAEKYLYSLPISNNPLVLVSNKKNPLTSLI^IAGKTTQEDTGTSNAQFINNW 
NQKHTDNPATINFSGEDIGKRILDLANGEFDFLVFDKVSVQKI IKDRGLDLSWDLPSADSPSNYI IFSSDQKEFKE 
QFDKALKELYQDGTLEKLSNTYLGGSYLPDQSQLQ2 

ID215 - 4211,1 

50 ATGAAAAAAAATAGTTTATATATCATATCCTCACTCTTTTTTGCTTGTGTCTTATTTGTCTATGCTACGGCGACGAA 
TTTTCAAAAC AGT AC CAGTG CT AGGCAGGT AAAAACGGAAACCTAT ACTAAT ACAGT AAC AAATGTCC CTATTG AC A 
T ACG CT AT AATAGTG ATAAGTATTTTATTAGCGGTTTTG CTTCAGAAGTATC AGTGGTCTTGACTGGTG CAAATCGC 
CTATCGCTAGCTAGTGAAATGCAAGAAAGTACACGTAAATTCAAGGTTACTGCTGACCTAACAGATGCCGGTGTTGG 
AACGATTGAAGTTCCTTTGAGCATTGAAGATTTACCCAATGGGCTGACCGCTGTGGCGACTCCGCAAAAAATTACAG 

55 TCAAGATTGGTAAGAAGGCTCAGAAGGATAAGGTAAAGATTGTACCAGAGATTGACCCTAGTCAAATTGATAGTCGG 
GTACAAATTGAAAATGTCATGGTGTCAGATAAAGAAGTGTCTATTACGAGTGACCAAGAGACATTGGATAGAATTGA 
TAAGATTATCGCTGTTTTGCCAACTAGCGAACGTATAACAGGTAATTACAGTGGTTCAGTACCTTTGCAGGCAATCG 
ACCGCAATGGTGTTGTCTTACCGGCAGTTATCACTCCGTTTGATACAATAATGAAGGTGACTACAAAACCAGTAGCA 
CCAAGTTCAAGCACATCAAATTCAAGTACAAGCAGTTCATCGGAGACATCTTCGTCAACGAAAGCAACTAGTTCAAA 

60 AACGAATTAA 
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MKKNSLYIISSLFFACVLFVYATATNFQNSTSARQVKTET 

LSLASEMQESTRKFKVTADLTDAGVGTIEVPLSIEDLPNGLTAVATPQKITVKIGKKAQKDKVKIVPEIDPSQIDSR 
VQIENVMVSDKEVSITSDQETLDRIDKI IAVLPTSERITGNYSGSVPLOAIDRNGWLPAVITPFDTIMKVTTKPVA 
PSSSTSNSSTSSSSETSSSTKATSSKTNZ 

ID216 - 4127.3 

ATGTTGATTGGCGAAGGGTATCGGACTTTCCCTGTCCTGATTTATACCCAATTTATTAGCGAGGTTGGAGGAAATTC 
TGCTTTTGCAATTATGGCGATTATCATTGCCTTGGCAATTT^ 

TCAGCATGAATCTGCTCCATCCAATTGAGCCTAAAAAAACTACAAAAGGAAAAATGGCTGC^^ 
TACGGAATTATCTTTATCrCTGTTTTACCTCAAATCTACTTAATTTATACCTCTTTCCTAAA 

AT CTGTTAAAGGTT ATTCTC CAAAC AGTTACAAGGTAGCTTTCCAT CGTATGGGATCTG CTATTTTCAATACCATTC 

GTATCCCTTTGATTGCCTTAGTTCTAGTTGTTCTATTTGCGACATTTATCTCCTACCTAGCCGTTAGAAAACGGAAT 

TTGTTTACAAACTTAATTGACAGCCTCAGTATGGTACCTTATATTGTACCAGGAACCGTTCTAGGGATTGCCTTC 

TTCTTCCTTCAATACTGGTCTATTTGGAAGTGGATTTCTTATGATTACAGGGACTGCTTTCATCTTGATTATGTCTC 

TATCTGCCAGAAGATTACCTTATACTATTCGCTCATCTGTTGCTAGCTIT^ 

GCTGCTGAAAGCTTAGGAAGTAGTCGTCTCAATACCTTTGCTAAGATTACAACTCCAATGATGCTATCTGGTATCAT 
TTCTGGAG CCAT CTTATCTTGA 



MLIGEGYRTFPVLIYTQFISEVGGNSAFAIMAI IIALAIFLIQKHIANRYSFSMNLLHPI^ 

YGIIFISVLPQIYLIYTSFLKTSGMVSVKGYSPNSYKVAFHRMGSAIFNTIRIPLIALVLWLFATFISYLAVRKRN 
LFTNLIDSLSMVPYIVPGTVLGIAFISSFNTGLFGSGFLMITGTAFILIMSLSARRLPYTIRSSVASLQQIAPSIEE 
AAESLGSSRLNTFAKITTPMMLSGIISGAILSZ 
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Table 4 
ID301 

ATGAATAAGAAAAAAATGATTTTAACAAGTCT^ 

TACTTTTGTAAGAGCAGAAGAATCTCCACAAGTTGTCGAAAAATCTTCATTAGAGAAGAA^ 
CAAAAGCTGATACTGCCAAGAAAGATTACGAAACGGCTAAAAAGAAAGCAGAAGACGCTC^ 

GATCAGAAGAGAACTGAGGAGAAAGCTCGAAAAGAAGCAGAAGCATCTCAAAAATTGAATGATGTGGCGCTTGTTGT 

TCAAAATGCATATAAAGAGTACCGAGAAGTTCAAAATCAACGTAGTAAATATAAATCTGACGC^ 

AAT T AAC AGAG GT CG ACT CT AAAAT AGAG AAGG CT AG G AAAG AG C AACAGG ACTT G CAAAAT AAATTT AATG AAGT A 

AGAGCAGTTGTAGTTCCTGAACCAAATGCGTTGGC TG AGACT AAG AAAAAAGCAGAAG AAG CTAAAGCAGAAG AAAA 

AGTAGCTAAGAGAAAATATGATTATGCAACTCTAAAGGTAGCACTAGCGAAGAAAGAAGTAGAGGCTAAGGAACTTG 

AAATTGAAAAACTTCAATATGAAATTTCTACTTTGGAACAAGAAGTTGCTACT^ 

AAAAAACTTCTTGCTGGTGCGGATCCTGATGATGGCACAGAAGTT ATAGAAG CTAAATT AAAAAAAGGAGAAG CTGA 

GCTAAACGCTAAACAAGCTGAGTTAGCAAAAAAACAAACAGAACTTGAAAAACTTCTTGA 

GTAAGACTCAGGATGAATTAGATAAAGAAGCAGAAGAAGCTGAGTTGGATAAAAAAGCTGATGAAC^ 

GTTGCTGATTTAGAAAAAGAAATTAGTAACCTTGAAATATTACTTGGAGGGGCTGATCCTGAAGATGATACTGCTC 

TCTTCAAAATAAATTAGCTGCTAAAAAAGCTGAGTTAGCAAAAAAACAAACAGAACTTG 

TTGAT C C TGAAG GTAAGACT CAGGATGAATT AG AT AAAG AAG C AG AAGAAG CTGAGTT G GAT AAAAAAG CTG ATGAA 
CTTCAAAATAAAGTTGCTGATTTAGAAAAAGAAATTAGTAACCTTGAAATATTACTTGGAGGGGCTGATTCT 
TGATACTGCTGCTCTTCAAAATAAATTAGCTACTAAAAAAGCTGAATTGGAAAAAACTCAAAAAGAATTAGATGCAG 
CTGTTAATGAGTTAGGCCCTGATGGAGATGAAGAAGAAACTCCAGCGCCGGCTCCTCAACCAGAGCAACCAGCTCCT 
G CACCAAAACCAGAGC AACCAGCTCCAG CTCCAAAACCAGAGCAACCAGCTCCTGCAC CAAAACCAGAG CAACCAG C 
TCCAG CT CCAAAACCAGAGCAACCAG CT CCAGCTCCAAAACCAGAGCAAC CAG CTAAGC CGG AGAAACCAGCTGAAG 
AGCCTACTCAAC CAGAAAAACCAGC CACTCCAAAAACAGGCTGGAAACAAGAAAACGGT ATGTGGTATTTCTACAAT 
ACTGATGGTTCAATGG CAATAGGTTGGCTCC AAAACAACGGTTCATGGTACTACCTAAACGCTAACGGCG CTATGGC 
AACAGGTTGGGTGAAAGATGGAGATACCTGGTACTATCTTGAAGCATCAGGTGCTATGAAAGCAAGCCAATGGTTCA 
AAGT AT CAGAT AAATGGTACT ATGTCAACAGCAATGGCG CTATGG CGACAGGCTGG CTCC AAT ACAATGGCTCATGG 
TACT ACCTCAACGCTAATGGTGAT ATGGCGACAGGATGG CTCCAAT ACAACGGTTCATGGT ATT ACCTC^ CTAA 
TGGTGATATGGCGACAGGATGGGCTAAAGTCAACGGTTCATGGTACTACCTAAACGCTAACGGTGCTATGGCTACAG 
GTTGGGCTAAAGTCAACGGTTCATGGTACTACCTAAACGCTAACGGTTCAATGGCAACAGGTTGGGTGAAAGATGGA 
GATACCTGGTACTATCTTGAAGCATCAGGTGCTATGAAAGCAAG C CAATGGTTCAAAGTATCAG AT AAATGGTACT A 
TGTO^TGGCTTAGGTGCCCTTGCAGTCAACACAACTGTAGATGGCTATAAAGTCAATGCCAATGGTGAATGGGTTT 
AA 

MNKKKMILTSLASVAILGAGFVTSQPTFVRAEESPQWEKSSLEKKYEEAKAKADTAKKDYETAKK 
DQKRTEEKARKE AEASQKLNDVALWQNAYKEYREVQNQRSKYKSDAEYQKKLTEVDS KI EKARKEQQDLQNKFNEV 
RAVWPEPNALAETKKKAEEAKAEEKVAKRKYDYATLKVA^^ 
KKLIAGADPDDGTETVIRAKLKKGEAELNAKQAEIJ^^ 
VADLEKEISNLEILLGGADPEDDTAALQNKLAAKKAEIA 

LQNKVADLEKE I SNLE I LLGGADS EDDTAALQNKLATKKAELEKTQKELDAALNELGPDGDEEETPAPAPQPEQPAP 
APKPEQPAPAPKPEQPAPAPKPEQPAPAPKPEQPAPAPKPEQPAKPEKPAEEPTQPEKPATPKTGWKQENGMWYFYN 
TDGSMAIGWLQNNGSWYYLNANGAMATGWVKDGDTO^ 

YYLNANGDMATGWLQYNGSWYYLNANGDMATGWAKVNGSWYYI^ANGAMATGWAKVNGSWYYLNAN 
DTW Y Y LE ASGAM KAS QW FKVS D KW Y YVNG LG ALAVNTTVDG Y KVNANG EWV Z 

ID302 

ATGTTTGCATCAAAAAGCGAAAGAAAAGTACATTATTCAATTCGTAAATTTAGTGTTGGAGTAGCTAGTGTAGTTGT 

TGCC^GTCTTGTTATGGGAAGTGTGGTTCATGCGACAGAGAACGAGGGAGCTACCCAAGTACCCACTTCI^CTAATA 

GGGCAAATGAAAGTC AGGCAGAACAAGGAGAAC AACCTAAAAAACTCGATT C AG AACG AGATAAGG CAAGGAAAGAG 

GTCGAGGAATATGTAAAAAAAATAGTGGGTGAGAGCTATGCAAAATCAACTAAAAAGCGACATACAATTACTGTAGC 

TCTAGTTAACGAGTTGAACAACATTAAGAACGAGTATTTGAATAAAATAGTTGAATCAACCTCAGAAAGCC 

AG ATACTGATGATGG AGAGTCG ATCAAAAGT AG ATG AAG CTGTGTCTAAGTTTGAAAAGGACTCATCTTCTTCGT CA 

AGTTCAGACTCTTCCACTAAACCGGAAGCTTCAGATACAGCGAAGCCAAACJ^ 

AGCAG AAG CT AAG AAG AAG GTTG AAG AAG C TG AG AAAAAAG C CAAG G AT C AAAAAGAAGAAG AT CGTCG T AACT ACC 
CAACCATTACTT ACAAAACGCTTGAACTTGAAATTGCrrGAGT CCGATGTGG AAGTTAAAA CTTGAACTA 
GTAAAAGTGAAAGCTAACGAACCTCGAGACGAGCAAAAAATTAAGCAAGCAGAAGCGGAAGTTGAGAGTAAACAAGC 
TGAGGCTACAAGGTTAAAAAAAATCAAGACAGATCGTGAAGAAGCAGAAGAAGAAGCTAAACGAAGAGCAGATGCTA 
AAG AG CAAGGTAAACC AAAGGGGCGGGCAAAACG AGG AGTTCCTGG AGAG CTAG C AAC ACCTGAT AAAAAAG AAAAT 
GATG CGAAGTCTTCAGATT CTAG CGT AGGTGAAG AAACTCTT CCAAGCC CAT CC CTG AAAC C AGAAAAAAAGGT AGC 
AGAAGCTGAGAAGAAGGTTGAAGAAGCTAAGAAAAAAGCCGAGGATCAAAAAGAAGAAGATCGCCGTAACTACCCAA 
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CCAAT ACTTACAAAACG CTTGAACTTG AAATTGCTGAGTCCG ATGTGGAAGTTAAAAAAG CGGAGCTTGAACT AGT A 
AAAGAGGAAGCTAAGGAACCTCGAAACGAGGAAAAAGTTAAGCAAGCAAAAGCGGAAGTTGAGAGTAAAAAAGCTGA 
GG C T AC AAGGT T AG AAAAAAT C AAG ACAG AT C GT AAAAAAGC AG AAG AAG AAG CT AAACG AAAAGCAGCAG AAG AAG 
ATAAAGTTAAAGAAAAACCAGCTGAACAACCACAACCAGCGCCGGCTCCAAAAGCAGAAAAACCAGCTCCAGCT 
5 AAAC CAGAGAATCCAGCTGAACAACCAAAAGCAG AAAAAC CAG CTGATCAACAAGCTGAAGAAGACTATGCTCGTAG 

ATC AG AAGAAGAATATAATCGCTTG ACTCAAC AG CAAC CGCCAAAAACTGAAAAACCAGCACAAC CATCTACTCCAA 
AAACAGGCTGGAAACAAGAAAACGGTATGTGGTACTTCTACAATACTGATGGTTCAATGGCGACAGGATGGCTCC^ 
AACAATGG CTCATGGTACTAC CTCAACAGCAATGG CGCTATGG CG ACAGGATGG CTCCAAAACAATGGTTC ATGGT A 
CT AT CT AAACG CT AATGG TTCAATGG CAACAGG AT GG C T C C AAAACAATG GTT CATGG TACT ACC T AAACG C T AATG 
10 GTTCAATGGCGACAGGATGGCTCCAATACAATGGCTCATGGTACTACCTAAACGCTAATGGCT 

TGGCTCCAATACAATGGCTCATGGTACTACCTAAACGCTAATGGTGATATGGCGACAGGTTGGGTGAAAGATGGAGA 
TACCTGGTACTATCTTGAAGCATCAGGTGCTATGAAAGCAAGCCAATGGTTCAAAGTATCAGATAAATGGTACTATG 
TCAATGGCTCAGGTGCCCTTGCAGTCAACACAACTGTAGATGGCTATGGAGTCAATGCCAATGGTGAATGGGTAAAC 
TAA 

15 

MFASKSERKVHYSIRKFSVGVASVWASLVMGSVVHATENEGATQVPTSSNRANESQAEQGEQPKKLDSERDKARKE 
VEEYVKKIVGESYAKSTKKRHTITVALVNELNNI^^ 

SSDSSTKPEASDTAKPNKPTEPGEKVAEAKKKVEEAEKKAKIX3KEEDRRNYPTITYKTLELEIAESDVEVKKAELEL 
VKVKANE PRDEOKI KQAEAEVES KC/AEATRLKKI KTDREEAEEE AKRRADAKEQGKPKGRAKRGVPGELATPDKKEN 
20 DAKSSDSSVGEETLPSPSLKPEKKVAEAEKKVEEAKKKAEDQKEEDRRN 
KEEAKEPRNEEKVKQAKAEVESKKAEATRLEKIKTDRKKAEEEAKRK^ 

KPENPAEQPKAEKPADQQAEEDYARRSEEEYNRLTCK2QPPKTEKPAQPSTPKTGWKQENGMWYFYNTDGSMATGWLQ 

NNGSWYYI^SNGAMATGWLQNNGSWYYLNANGSMATGWLQl^GSW^^ 

WLQYNGSWYYLNANGDMATGWVKDGDTWYYLEASGAMKASQWFK^^ 

25 z 

ID303 

ATGGTAAAAAGACGTATAAGGAGAGGGACGAGAGAACCTGAAAAAGTTGTTGTTCCTGAGCAATCATCTATTCCTTC 
GTATCCTGTATCTGTTACATCTAACCAAGGAACAGATGTAGCAGTAGAACCAGCTAA^ 
30 ACTGGAAACAAGAAAATGGTATGTGGTATTTTTATAATACTGATGGTTCCATGGC^UVCAGGTTGGGTACAAGTTAAT 
AGTTCATGGTACTACCTCAACAGCAACGGTTCTATGAAAGTCAATCAATGGTTCCAAGCT 

TGTAAATACATCGGGTGAGTTAGCGGTCAATACAAGTATAGATGGCTATAGAGTCAATGATAATGGTGAATGGGTGC 
GTTAA 

35 MVKRRIRRGTREPEKVWPEQSSIPSYPVSVTSNQGTDVAVEPAKAVAPTTDWKQENGMWYFYNTDGSMATGWVQVN 
SSWYYLNSNGSMKVNQWFQVGGKWYYVNTSGELAVNTSIDGYRVNDNGEWVRZ 

ID304 

CTGAATACAAGTTTTGTTCATGCTGCTGATGGGATTCAATATGTCAGAGATGATACTAGAGATAAAGAAGAGGGAAT 
40 AGAGTATGATGACGCTGACAATGGGGATATTATTGTAAAAGTAGCGACTAAACCT AAGGT AGTAACCAAGAAAATTT 

CAAGTACGCGAATTCGTTATGAAAAAGATGAAACAAAAGACCGTAGTGAAAATCCTGTTACAATTGATGGAGAGGAT 
GGCTATGTAACTACGACAAGGACCTACGATGTTAATCCAGAGACTGGTTATGTTACCGAACAGGTTACT 
AAAAGAAGCCACGGATACAGTTATCAAAGTTCCAGCTAAAAGCAAGGTTGAAGAAGTTCTTGTTCCATTTGCTA 
AAT AT G AAG CAG ACAATGAC C TTT CTG CAGGA C AGG AG CAAGAG ATT ACT CTAGG AAAG AATGGGAAAAC AG T T ACA 
45 ACGATAACTTATAATGTAGATGGAAAGAGTGGACAAGTAACTGAGAGTACTTTAAGTCAAAAAAAAGACTCTCAAAC 
AAGAGTTGTTAAAAAAAGAACCAAGCCCCAAGTTCTTGTCCAAGAAATTCGAATCGAAACA^ 

CAACTCTTGATAAAAGTCAAGAAGTAGAAGAAGTAGGAGAAATTGGTAAATTACTCTTACTACAATCTATACTGTAG 

LOTSFVHAADGIQYVRDDTRDKEEGIEYDDADNGDIIVKVATK^ 
50 GWTTTRTYDVNPETGYVTEQVTVDRKEATDTVIKVPAKSKVEEVLVPFATKYEM^ 

TI TYNVDG KSGQVTESTLSQKKDSQTRWKKRTKPQVLVQE I P I ETEYLDGPTLDKSQEVEEVGE IGKLLLLQS I LZ 



ID305 

55 ATGAAGCTTTTGAAAAAAATGATGCAAATCGCACTAGCCACATTTTTCTTCGGTTT 
TGCAGATGATTCTGAAGGATGGC^GTTTGTCCAAGAAAATGGTAG^^ 

CCTACTGGAGAGTGATAGATGGGAAGTACTATTATTTTGATCCTTTATCCGGAGAGATGGTTGTCGGCTGGCAATAT 
ATACCTGCTCCACACAAGGGGGTTACGATTGGTCCTTCTCCAAGAATAGAGATTGCTCTTAGACCAGATTGGTTTTA 
TTTTGGTCAAGATGGTGTATTACAAGAATTTGTTGGCAAGCAAGTTTTAGAAGC 
60 AACATCATGGGGAAGAATATGATAGCCAAGCAGAGAAACGAGTCTATTATTTTGAAGATCAGCGTAGTTATCATACT 
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TTAAAAACTGGTTGGATTTATGAAGAGGGTCATTGGTATTATTTACAGAAGGATGGTGGCTTTGATTCGCGCATCAA 
CAGATTGACGGTTGGAGAGCTAGCACGTGGTTGGGTTAAGGATTACCCTCTTACGTATGATGAAGAGAAGCTAAAAG 
CAGCTCCATGGTACTATCTAAATCCAGCAACTGGCATTATGCAAACAGGTTGGCAATATCTAGGTAATAGATGGTAC 
TACCTCCATTCGTCAGGAGCTATGGCAACTGGCTGGTATAAGGAAGGCT CAACTTGGTACTATCTAGATG CTGAAAA 
5 TGGTGATATGAGAACTGGCTGGCAAAACCTTGGGAACAAATGGTACTATCTCCGTTCATC^ 

GTTGGTATCAGGAAAGTTCGACTTGGTACTATCTAAATGCAAGTAATGGAGATATGAAAAC^GGCTGGTTCCAAGT^ 
AATGGTAACTGGTACTATGCCTATGATTCAGGTGCTTTAGCTGTTAATACCACAGTAGGTGGTTACTACTTAAACTA 
TAATGGTGAATGGGTTAAGTAA 

10 MKLLKKMMQIALATFFFGLIATimfFADDSEGW 

IPAPHKGVTIGPSPRIEIALRPDWFYFGQDGVLQEFVGKQVLEAKTATNTNKHHGEEYDSQAEKRVYYFEDQRSYHT 
LKTGWIYEEGHWYYLQKDGGFDSRINRLTVGEIiARGWVKDYPLTYDEEKLKAAPWYYLNPATG 
YLHSSGAMATGWYKEGSTWYYI^AENGDMRTGWQNLGNKWYYLRSSGAI^^ 
NGNWYYAYDSGAIiAVNTTVGGYYLNYNGEWVKZ 

15 

ID30 6 

TTGGCTGGTAGATATGGTTCTGCTGTTCAGTGTACAGAAGTGACTGCCTCAAACCTTTCAACAGTTAAAACTAAAGC 
TACGGTTGTAGAAAAACCACTGAAAGATTTTAGAGCGTCTACGTCTGATCAGTCTGGTTGGGTGGAATCTAATGGTA 
AATGGTATTTCTATGAGTCTGGTGATGTGAAGACAGGTTGGGTGAAAACAGATGGTAAATGGTACTATTTGAATGAC 
20 TTAGGTGTCATGCAGACTGGATTTGTAAAATTTTCTGGTAGCTGGTATTACTTGAGCAATTCAGGTGCTATGTTTAC 
AGGCTGGGGAACAGATGGTAGCAGATGGTTCTACTTTGACGGCTCAGGAGCTATGAAGACAGGCTGGTACAAGGAAA 
ATGGCACTTGGTATTACCTTGACGAAGCAGGTATCATGAAGACAGGTTGGTTTAAAGTCGGACCACACTGGTACTAT 
GCCTACGGTTCAGGAGCTTTGGCTGTGAGCACAACAACACCAGATGGTTACCGTGTAAATGGTAATGGTGAATGGGT 
AAACTAG 

25 

LAGR YG S AVQ CTE VT ASNLSTVKT KATWE KP LKD FRAS T S DQS GWVE S NGKW Y F Y ESG D V KTGWV KTDGKW YYLND 
LGVMQTGFVKFSGSWYYLSNSGAMFTGWGTDGSRWFYFDGSGAMKTGWYKJENGTWYYLDEAGIMKTGWFKVGPHW^ 
AYGSGALAVSTTTPDGYRVNGNGEWVNZ 

ATGAAAATTTTGAAAAAAACTATGCAAGTTGGACTGACAGTATTTTTCTTTGGTTTGCTAGGGACCAGTACAGTATT 
TGCAGATGATTCTGAAGGATGGCAGTTTGTCCAAGAAAACGGAAGAACCTACTACAAAAAGGGGGACCTCAAAGAAA 
C CTACTGGCGAGTGATTGATGGTAAGT ACTATTATTTTGATT CT CTATCTGG AGAGATGGTTGT CGGCTGG CAAT AT 
ATCCCGTTTCCATCTAAAGGTAGTACAATTGGTCCTTACCCAAATGGTATCAGATTAGAAGGTTTTCCAAAGTCAGA 
GTGGTACTACTTCGATAAAAATGGAGTGCTACAAGAGTTTGTTGGTTGGAAAACATTAGAGATTAAAACTAAAGACA 
GTGTTGGAAGAAAGTACGGGGAAAAACGTGAAGATTCAGAAGATAAAGAAGAGAAGCGTTATTATACGAACTATTAC 
TTTAATCAAAATCATTCTTTAGAGACAGGTTGGCTTTATGATCAGTCTAACTGGTATTATCTAGCTAAGACGGAAAT 
TAATGGAGAAAACTACCTTGGTGGTGAAAGACGTGCGGGGTGGATAAACGATGATTCGACTTGGTACTACCTAGATC 
CAACAACTGGT ATTATGCAAACAGGTTGG CAATATCTAGGTAATAAGTGGTACTAC CTCCGTTCCTCAGG AG CAATG 
GCCACTGGCTGGTATCAGGAAGGTACCACTTGGTATTATTTAGACCACCCAAATGGCGATATGAAAACAGGTTGGCA 
AAACCTTGGGAACAAATGGTACTATCTCCGTTCATCAGGAGCTATGGCAACTGGTTGGTATCAAGATGGTTCAACTT 
GGTACTACCTAAATGCAGGTAATGGAGACATGAAGACAGGTTGGTTCCAGGTCAATGGCAACTGGTACTATGCTTAT 
AGCTCAGGTGCTTTGGCAGTGAATACGACCGTAGATGGCTATTCTGTCAACTATAATGGCGAATGGGTTCGGTAA 

MKILKKTMQVGLTVFFFGLLGTSTVFADDSEGWQFVQENGRTYYKKGDLKETYWRVIIX3KYYYFDSLSGEMVVGWQY 
IPFPSKGSTIGPYPNGIRLEGFPKSEWYYFDKNGVLQEFVGWKTLEIKTKDSVGRKYGEKREDSEDKEEKRYYTNYY 
FNQNHSLETGWLYTJQSNWYYLAKTEINGENYLGGERRAGWINDDSTW 

ATGWYQ EGTTW YYLDHPNGDMKTGWQNLGN KW YYLRS SGAMATGW YQDGSTWYYLNAGNGDMKTGW FQVNGNW Y YAY 
SSGALAVNTTVDGYSVNYNGEWVRZ 



ID3 08 

ATGAAAAAGAAATTAACTAGTTTAGCACTTGTAGGCGCTTTTTTAGGTTTGTC^TGGTATGGGAA 
AGAAAGTTCAGGAAATAAAATCCACTTTATCAATGTTCAAGAAGGTGGCAGTGATGCGATTATTCT^ 
55 GACATTTTGCCATGGTGGATACAGGAGAAGATTATGATTTCCCAGATGGAAGTGATTCTCGCTATCCATGGAGAGAA 
GGAATTGAAACGTCTTATAAGCATGTTCTAACAGACCGTGTCTTTCGTCGTTTGAAGGAATTGGGTGTCCAAAAACT 
TGATTTT ATTTTGGTGAC CCATACCCAC AGTG ATCATATTGG AAATGTTGATGAATTACTGTCT AC CTAT CCAGTTG 
AC CG AGT CT ATCTT AAGAAATATAGTG AT AGT CGT ATTACTAATTCTG AACGT CT ATGGGATAATCTGT ATGG CT AT 
GATAAGGTTTT AC AG ACTGCTG CAG AAAAAGGTGTTTCAG TT ATT CAAAAT AT CACAC AAGGGG ATG CTC ATTTT CA 
60 GTTTGGG GAC ATGG ATATTCAG CTCTATAATT ATGAAAATGAAACTGATTCAT CGGGTG AATT AAAGAAAATTTGGG 



30 
35 
40 
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ATG ACAATTCCAATTCCTTGATT AG CGTGGTG AAAGTC AATGG CAAGAAAATTTAC CTTGGGGGCGATTT AG ATAAT 
GTTCATGGAGCAGAAGACAAGTATGGTCCTCTCATTGGAAAAGTTGATTTGATGAAGTTTAATCATCACCATGATAC 
CAACAAATCAAATACCAAGGATTTCATTAAAAATTTGAGTCCGAGTTTGATTGTTCAAACTTCGGATAGTCTACCTT 
GGAAAAATGGTGTTGATAGTGAGTATGTTAATTGGCTCAAAGAACGAGGAATTGAGAGAATCAACGCAGCCAGCAAA 
5 GACTATGATGCAACAGTTTTTGATATTCGAAAAGACGGTTTTGTCAATATTTCAACATCCTACAAGCCGATTCCAAG 
TTTTCAAGCTGGTTGGCATAAGAGTGCATATGGGAACTGGTGGTATCAAGCGCCTGATTCTACAGGAGAGTATGCTG 
TCGGTTGGAATGAAATCGAAGGTGAATGGTATTACTTTAACCAAACGGGTATCTTGTTACAGAATCAATGGAAAAAA 
TGGAACAATCATTGGTTCTATTTGACAGACTCTGGTGCTTCTGCT 

TTATTTTAACAAAGAAAAC CAG ATGGAAATTGGTTGG ATT C AAGAT AAAGAGCAGTGGT ATTATTTGG ATGTTGATG 
1 0 GTTCTATGAAGACAGGATGGCTTCAATATATGGGG CAATGGT ATTACTTTG CTCC ATCAGGGG AAATGAAAATGGGC 

TGGGTAAAAGATAAAGAAACCTGGTACTATATGGATTCTACTGGTGTCATGAAGACAGGTGAGATAGAAGTTGCTGG 
TCAACATTATTATCTGGAAGATTCAGGAGCTATGAAGCAAGGCTGGCATAAAAAGGCAAATGATTC 
AGACAGACGGTTC7VCGAGCTGTGGGTTGGATCAAGGACAAGGATAAATGGTACTTCTTGAAAGAAAATGGTCAATTA 
CTTGTGAACGGTAAGACACCAGAAGGTTATACTGTGGATTCAAGTGGTGCCTGGTTAGTGGATGTTTCGATCGAGAA 
15 ATCTGCTACAATTAAAACTACAAGTCATTCAGAAATAAAAGAATCCAAAGAAGTAGTGAAAAAGGATCTTGAA 

AAGAAACGAGTCAACATGAAAGTGTTACAAATTTTTCAACTAGTCAAGATTTGACATCCTCAACTTCACAAAGCT 
GAAACGAGTGTAAACAAATCGGAATCAGAACAGTAG 

MKKKLTSLALVGAFLGLSWYGNVQAQESSGNKIHFII^QEGGSDAIILESNGHFAMVDTGEDYDFPDGSDSRYPWRE 
20 GIETSYKHVLTDRVFRRLKELGVQKIiDFILVTHTHSDHIGNVDELLSTYPVDRVYLKKYSDSRITNSERLVro 

DKVLQTAAEKGVSVIQNITQGDAHFQFGDMDIQLYNYENETDSSGELKKIWDDNSNSLISVVKVNGKKIYLGGDLDN 
VHGAEDKYGPLIGKVDLMKF^^^HHDTNKSNTK^ 

D YD ATV FD I RKDG FVN ISTSYKPIPS FQ AGWH KS A YGNW W YQ APDS TGE Y AVGWNE I EG E W Y Y FNQTG I L LQNQW KK 
WNNHWFYLTDSGASAKNWKKIAGIWYYFNKENQMEIGWIQDKEQWYYLDVDGSMKTGWLQYMGQWYYFA 
25 WKDKETWYYMDSTGVMKTGEIEVAGQHYYLEDSGAMKQGWHKKANDWYFYCT^ 

LVNGKTPEGYTVDSSGAWLVDVSIEKSATIKTTSHSEIKESKEWKKDLENKETSQHESVTNFSTSQDLTSSTSQSS 
ETSVNKSESEQZ 

ID3 09 

3 0 ATGGAAATTAATG1X3AGTAAATTAAGAACAGATTTGCCTCAAGTCGGCGTGCAACCATATAGGCAAGTACACGCACA 
CTCAACTGGGAAT CCGCATTCAACCGTAC AGAATG AAGCGG ATT ATCACTGGCGGAAAG ACC CAGAATTAGGTTTTT 
TCTCGCACATTGTTGGGAACGGTTGCATGATGCAGGTAGGACCTGTTGATAATGGTGCCTGGGACGTTGGGGGCGGT 
TGGAATGCTGAGACCTATGCAGCGGTTGAACTGATTGAAAGCCATTCAACCAAAGAAGAGTTCATGACGGACTACCG 
CCTTTATATCGAACTCTTACGCAATCTAG CAGATG AAGCAGGTTTG CCGAAAACGCTTGATACAGGGAGTTTAGCTG 

35 GAATTAAAACGCACGAGTATTGCACGAATAACCAACCAAACAACCACTCAGACCACGTTGACCCTTATCCATATCTT 
G CT AAATGGGGCATTAG CCGTGAGCAGTTTAAGC ATG AT ATTGAG AACGGCTTG ACGATTG AAACAGGCTGG C AGAA 
GAATG ACAC TGGCTACTGGTACGTACAT TCAGACGG CTCTT ATCCAAAAG AC AAGTTTGAG AAAATCAATGG CACTT 
GGTACTACTTTGACAGTTCAGGCTATATGCTTGCAGACCGCTGGAGGAAGCACACAGAGGGCAACTGGTACTGGTTC 
GACAACTCAGGCGAAATGGCTACAGGCTGGAAGAAAATCGCTGATAAGTGGTACTATTTCAACGAAGAAGGTGCCAT 

40 G AAGACAGGCTGGGTCAAGT ACAAGGACACTTGGT ACTACTT AG ACG CTAAAGAAGGCG CCATGGTATCAAATGCCT 

TTATCC AGTCAG CGGACGGAACAGGCTGGT ACT ACCTCAAAC CAGACGG AACAC TGGCAG ACAAGC CAG AATTCACA 
GTAGAGCCAGATGGCTTGATTACAGTAAAATAA 

MEINVSKLRTDLPQVGVQPYRQVHAHSTGNPHSTVQNEADYHWRKDPELGFFSHIVGNGCIMQVGPVDNGAWDVGGG 
45 WNAETYAAVELIESHSTKEEFmDYRLYIELLRNLADEAGLPCT^ 

AKWG I S REQ FKHD I ENGLT I ETG WQ KNDTG YWYVH SDGSYPKDKFEKI NGTW Y Y FDS S G YMLADRWRKH TDGNW YW F 

DNSGEMATGWKKIADKWYYFNEEGAMKTGWVKYKDTWYYL^^ 

VEPDGLITVKZ 

50 ID310 

ATGGGCACAACAGGATTTACAATAATTGACTTAATTATCTTGATTGTTTATTTACTTGCGGTGTTGGTTGCAGGTAT 
CTATTTCTCTAAAAAAGAGATGAAAGGAAAAGAGTTCTTTAAAGGAGATGGTTCGGTTCCTTGGTATGTTAC 
TATCCATTTTTGCCACAATGCTCAGTCCGATTTCCTTCTTGGGACTCGCTGGTAGCTCTTATGCAGGTAGCTGGATT 
TTATGGTTTGCTCAATTAGGGATGGTAGTAGCTATTCCACTGACAATTCGTTTTATCTTACCTATCTTTGCACGGAT 

55 AGA(^TCGATACGG<^TATGATTACTTGGATAAACGTTTTAATTCT 

TTATT ATTTATCAATTGGGACGTATGTCTAT C ATT ATGT ACCTC CC ATCAGCTGGTTT AT CAGT ATTGACAGG AATT 
GACATCAATATTTTGATTATTTTGATGGGTGTAGTTGCAATTGTTTATTCTTATACTGGTGGTCTAAAATCCGTATT 
ATGGACAGACTTTATTCAAGGTGTGATTCTGATTAGTGGTGTCGTTTTAGCTTTATTTGTACTGATT^ 
AAGGTGGCTTTGGTGCAGTAGCAGAAACATTAGCAAACGGGAAATTCCTTGCTGCAAATGAAAAACTTTTC 

60 AACTTGCTTTCAAACTCCATCnTTTTAATTGTGATGGGTTCAGGCTTTACAATCTTGTCTTCCTATGCTTCATCTCA 
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AGATTTGGTTCAACGTTTTACTACAACACAAAATATTAAGAAACTTAATAAGATGTTGTTCACAAACGGTGTTTTGT 
CACTTGCAACTGCAACAGTCTTTTACTTGATTGGTACAGGCTTGTACGTATTCTATCAAGTACAAAATGCAGATAGT 
GCAGCTAGCAATATCCCTCAAGACCAAATCTTTATGTACTTTATTGCATACCAGTTACCAGTAGGTATCACAGGTTT 
GATCTTGG CAG CGATTTATGCAGCATCTCAATCAACTATTTCAACAGGTTTGAACTCTGTTGCAACTTCATGGACAT 
5 TGGATATTCAAGATGTCATTTCTAAAAATATGTCAGACAAT CGT CGT ACGAAAATTG CACAATTCGTATCTCTAGCA 

GTAGGTTTATTCTCAATTGGTGTTTCCATTGTCATGGCTCACTCAGATATTAAATCTGCATACGAATGGTTCAATAG 
TTTCATGGGACTTGTACTTGGTCTACTTGGTGGTGTATTTATTCTTGGATTTGTTTCTAAAAAAGCAAATAA^ 
GTGCTTATGCAGCGCTGATTGTATCAACCATCGTCATGGTATTTATTAAATACTTCCTTCCT 
TACTGGGCATATTCATTGATTTCAATCTCTGTATCAGTAGTTTCAGGTTATATTGTAT 
1 0 AGTATCTGCACCTAAATATACAATOATTCATGATATTACAGAAATTAAAGCGGATTCAAGTTGGGAAGTTCGTCACT 

AA 

MGTTG FTIIDLIILI V Y LLAVLVAG I Y FS KKEMKGKE FF KGDGS VPW YVT S VS I F ATMLS P I S F LG LAGS S Y AG S W I 
LWFAQLGMWAI PLTIRFILPIFARIDIDTAYDYLDKRFNSKALRIISALLFIIYQLGRMSI IMYLPSAGLSVLTGI 
15 DINILIILMGWAIVYSYTGGLKSVLWTDFIQGVILISGWIJUjFVL^ 

NLLSNS I FLIVMGSGFTILSSYASSQDLVQRFTTTQNI kklnkmlftngvlslatatvfyligtglyvfyqvqnads 
AASNIPQDQIFMYFIAYQLPVGITGLILAAIYAASQSTISTGLNSVATSWTLDIQDVISKNMSDNRRTKIAQFVSLA 
VGLFSIGVSIVMAHSDIKSAYEWFNSFMGLVLGLLGGVFILGFVSKKANKQGAYAALIVSTIVMVFIKYFLPPTAVS 
YWAYS L I S I SVS WSGY I VSVLTGNKVS APKYTT I HD I TE I KADS S WEVRHZ 

20 

ID311 

ATGAAAATTAATAAAAAATATCTAGCAGGTTCAGTGGCAGTCCTTGCCCTAAGTGTTTGTTCCTATGAGCTTGG 
TCAC CAAGCTGGTCAGG ATAAGAAAGAGTCTAATCGAGTTG CTT AT ATAG ATGGTG AT CAGGCTGGTCAAAAGG CAG 
AAAACTTGACACCAGATGAAGTC AGTAAG AGGGAGGGGATCAACG CCG AACAAATCGTC ATCAAGATTACGGAT CAA 

25 GGTTATGTGACCTCTCATGGAGACCATTATCATTACTATAATGGCAAGGTCCCTTATGATGCCATCATCAGTGAAGA 
GCTCCT CATGAAAGATCCGAATTATCAGTTGAAGGATTCAGACATTGTCAATGAAATCAAGGGTGGTTATGT CATCA 
AGGTAGACGGAAAATACTATGTTTACCTTAAGGATGCAGCTCATGCGGATAATATTCGGACAAAAGAAGAGATTAAA 
CGTC AGAAGCAGGAACGCAGTCATAATCACGGGTCAGGAGCTAACGATCATG CAG TAGCTGCAG CC AGAGCC CAAGG 
ACGCTATACAACGGATGATGGGTATATCTTCAATGCATCTGATATCATTGAGGACACGGGTGATGCTTATATCGTTC 

30 CTCACGGCGACCATTACCATTAC^TTCCTAAGAATGAGTTATCAGCTAGCGAGTTAGCTGCTGCAGA 

AATGGGAAGCAGGGATCTCGTCCTTCTTCAAGTTCTAGTTATAATGCAAATCCAGCTCAACCAAGATTGTCAGAGAA 
CCACAATCTGACTGTCACTCCAACTTATCATCAAAATCAAGGGGAAAACATTTC^ 

CTAAACCCTTATC AGAACG CCATGTGG AATCTG ATGG CCTT ATTTTCGACC CAGCG CAAATCAC AAGT CGAACCG CC 
AGAGGTGTAGCTGTCCCTCATGGTAACC^TTACCACTTTATCCCI^ATGAACAAATGTCTGAATTGGAAAAACGAAT 

35 TGCTCGTATTATTCCCCTTCGTTATCGTTCAAACCATTGGGTACCAGATTCAAGACCAGAACAACCAAGTCCACAAT 
CGACT CCGGAACCTAGTCCAAGTCCGCAACCTGCACCAAATCCTCAAC CAG CTCCAAGCAATCCAATTGATGAGAAA 
TTGGTCAAAGAAGCTGTTCGAAAAGTAGGCGATGGTTATGTCTTTGAGGAGAATGGAGTTTCTCGTTATATCCCAGC 
CAAGGATCTTTCAGCAGAAACAGCAGCAGGCATTGATAGCAAACTGGCCAAGCAGGAAAGTTTATCTCATAAGCTAG 
GAGCTAAGAAAACTGACCTCCCATCTAGTGATCGAGAATTTTACAATAAGGCTTATGACTTACTAGCAAGAATTCAC 

40 CAAGATTTACTTGATAATAAAGGTCGACAAGTTGATTTTGAGGCTTTGGATAACCTGTTGGAACGACTCAAGGATGT 
CCCAAGTGATAAAGTCAAGTTAGTGGATGATATTCTTGCCTTCTTAGCTCCGATTCGTCATCCAGAACGTTTAGGAA 
AAC CAAATGCGCAAATTACCT ACACTG ATGATGAGATT CAAGTAGCCAAGTTGG CAGGCAAGTAC ACAACAG AAGAC 
GGTT AT ATCTTTGATC CTCGTGAT AT AACCAGTG ATGAGGGGG ATG CCTATGTAACTCCACATATG ACC CAT AGCCA 
CTGGATTAAAAAAGATAGTTTGTCTGAAGCTGAGAGAGCGGCAGCCCAGGCTTATGCTAAAGAGAAAGGTTTGACCC 

45 CTCCTTCGACAG ACC ATCAGGATT CAGG AAAT ACTGAGGC AAAAGGAGCAGAAGCT ATCTACAAC CGCGTGAAAGCA 

GCTAAGAAGGTGCCACTTGATCGTATGCCTTACAATCTTCAATATACTGTAGAAGTCAAAAACGGTAGTTTAATCAT 
ACCTCATTATGACCATTACCATAACATCAAATTTGAGTGGTTTGACGAAGGCCTTTATGAGGCACCTAAGGGGTATA 
CTCTTGAGGATCTTTTGGCGACTGTCAAGTACTATGTCGAACATCCAAACGAACGTCCGCAT^ 

GGTAACGCTAGCGACCATGTTCAAAGAAACAAAAATGGTCAAGCTGATACCAATCAAACGGAAAAACCAAGCGAGGA 
50 GAAACCTCAGACAGAAAAAC CTGAGGAAGAAACCCCT CGAG AAGAG AAACCG CAAAGCGAGAAACCAG AGTCTCCAA 

AACC AACAG AGG AAC CAGAAGAATCACCAGAGGAATCAGAAGAACCTC AGGT CG AG ACTGAAAAGGTTGAAG AAAAA 
CTGAGAG AGGCTGAAGATTTACTTGGAAAAATCCAGGATCCAATT ATC AAGT CCAATG CCAAAGAGACTCTCACAGG 
ATTAAAAAAT AATTT ACT ATTTGG CACCCAGGACAACAAT ACTATTATGG C AGAAG CTG AAAAACT ATTGGCTTT AT 
TAAAGGAGAGTAAGTAA 

55 

MKINKKYLAGSVAVIjALSVCSYELGRHQAGQDKKESNRVAYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQ 
GYVTS HGDHYHY YNGKVPYDAI I SEELLMKDPNYQLKDSD I VNEI KGGYV I KVDGKYYVYLKDAAHADN I RTKEE I K 
RQKQERSHNHGSGANDHAVAAARAQGRYTTDDGYIFNASDI IEDTGDAYIVPHGDHYHYI PKNELSASELAAAEAYW 
NGKQGSRPSSSSSYNANPAQPRLSENHNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLI FDPAQITSRTA 
60 RGVAVPHGNHYHFIPYEQMSELEKRIARIIPLRYRSNHWVPDSRPEQPSPQSTPEPSPSPQPAPNPQPAPSNPIDEK 
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LVKEAVRKVGDGYVFEENGVSRYI PAKDLSAETAAGIDSKLAKQES LSHKLGAKKTDLPSSDREFYNKAYDLLARIH 
QDLLDNKGRQVDFEALDNLLERLKDVPSDKVKLVDDIUVFI^^ 

GYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTDHQDSGNTEAKGAEAIYNRVKA 
AKKVPIiDRMPYNLrQYTVEVKNGSLI I PHYDHYHNI KFEWFDEGLYEAPKGYTLEDLLATVKYYVEHPNERPHSDNGF 
5 GNASDHVQRNKNGQADTNQTEKPSEEKPQTEKPEEETPREEKPQSEKPESPKPTEEPEESPEESEEPQVETEKVEEK 
LREAEDLLGKI QDP 1 1 KSNAKETLTGLKNNLL FGTQDNNTI MAEAE KLLALLKES KZ 

ID312 

ATGGAGGGATTGGTTAGAGTGCATTTATTGCCTGTATTTGGCGATTACAAGCTATCTAAACTTACTAC GCCT ATTCT 
10 TCAACAGCAAGTAAACAAATGGGCTGACAAGGCAAATAAAGGCGAAAAAGGGGCATTTGCTAACTACTCTTTGCTCC 
ATAACATGAATAAGCGTATTTTGAAATATGGCGTAGCTATCCAGGTAATACAATACAACCCAGCTAATGATGTCATC 
GTTCCACGCAAACAGCAAAAAGAAAAGGCTGCTGTCAAATACTTAGACAACAAAGAAT 

TTTAGATGCTCTGGATCAATCAAATTATGAGAACTTATTTCATGTTGTTCTGTATAAGACTTTATTGGCCACTG 
GCCGTATTAGTGAGGCTCTGGCTCTTGAATGGTCTGATATTGACCTAGAAAGCGGTGTTATCAGCATCAATAAGACA 

1 5 CTAAACCGCTATCAGGAAATAAACTCACCTAAATCAAGCGCTGGTTATCGTGATATACCAATAGACAAAGCCAC7VTT 
ACTTTT ACTGAAACAATACAAAAACCGTCAACAAATT CAGTCTTGGAAATTAGG CCG ATCTGAAACAGTTGTATTCT 
CTGTATTT ACGGAGAAATATGC TTATGCTTGTAACTT ACGCAAACG CCT AAATAAGCATTTTGATG CTG CTGGAGT A 
ACTAACGTATCATTTCATGGTTTCCGCCATACACATACTACTATGATGCTCTATGCTCAGGTTAGCCCGAAAGATGT 
TCAGTATAGATTAGGCCACTCTAATTTAATGATCACTGAAAATACTTACTGGCATACTAACCAAGAGAATGCAAAAA 

20 AAGCCGTCTCAAATTATGAAACAGCTATCAACAATTTATAA 

MEGLVRVHL L P VFGD YKLS KLTT P I LQQQ VNKW ADKAN KG E KGAF AN YS L LHNMN KR I L KYGV A I Q V I Q YN P ANDV I 
VPRKQQKEKAAVKYLDNKELKQFLDYLDALIX3SNYENLFDWLYKTLLATGCRISEAIJVLEWSDIDLESGV 
LNRYQEINSPKSSAGYRDIPIDKATLLLLKQYKNRQQIQSWKLGRSETWFSVFTEKYAYACNLRKRI^KHFDAAGV 
25 TNVSFHGFRHTHTT^LYAQVSPKDVQYRLGHSNLMITEOT 
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CLAIMS: 

1. A Streptococcus pneumoniae protein or polypeptide having a sequence 
selected from those shown in table 2. 

2. A Streptococcus pneumoniae protein or polypeptide having a sequence 
selected from those shown in table 4. 

3. A protein or polypeptide as claimed in claim 1 or claim 2 provided in 
substantially pure form. 

4. A protein or polypeptide which is substantially identical to one defined in any 
one of claims 1 to 3. 

5. A homologue or derivative of a protein or polypeptide as defined in any one of 
claims 1 to 4. 

6. An antigenic and/or immunogenic fragment of a protein or polypeptide as 
defined 

in Tables 2-4. 

7. A nucleic acid molecule comprising or consisting of a sequence which is: 

(i) any of the DNA sequences set out in Table 1 or their RN A equivalents; 

(ii) a sequence which is complementary to any of the sequences of (i); 

(iii) a sequence which codes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 
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(iv) a sequence which is substantially identical with any of those of (i), (ii) 
and (iii); 

5 (v) a sequence which codes for a homologue, derivative or fragment of a 

protein as defined in Table 1. 

.8. A nucleic acid molecule comprising or consisting of a sequence which is: 

10 (i) any of the DNA sequences set out in Table 4 or their RNA equivalents; 

(ii) a sequence which is complementary to any of the sequences of (i); 

(iii) a sequence which codes for the same protein or polypeptide, as those 
15 sequences of (i) or (ii); 

(iv) a sequence which is substantially identical with any of those of (i), (ii) 
and (iii); 

20 (v) a sequence which codes for a homologue, derivative or fragment of a 

protein as defined in Table 4. 

9. The use of a protein or polypeptide having a sequence selected from those 
shown in Tables 2-4, or homologues, derivatives and/or fragments thereof, as an 

25 immunogen and/or antigen. 

10. An immunogenic and/or antigenic composition comprising one or more 
proteins or polypeptides selected from those whose sequences are shown in Tables 2- 
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4, or homologues or derivatives thereof, and/or fragments of any of these. 

11. An immunogenic and/or antigenic composition as claimed in claim 10 which 
is a vaccine or is for use in a diagnostic assay. 

12. A vaccine as claimed in claim 11 which comprises one or more additional 
components selected from excipients, diluents, adjuvants or the like. 

13. A vaccine composition comprising one or more nucleic acid sequences as 
defined in Tables 1, 3 or 4. 

14. A method for the detection/diagnosis of S.pneumoniae which comprises the 
step of bringing into contact a sample to be tested with at least one protein or 
polypeptide as defined in Tables 2-4, or homologue, derivative or fragment thereof. 

15. An antibody capable of binding to a protein or polypeptide as defined in 
Tables 2-4, or for a homologue, derivative or fragment thereof. 

16. An antibody as defined in claim 15 which is a monoclonal antibody. 

17. A method for the detection/diagnosis of S.pneumoniae which comprises the step 
of bringing into contact a sample to be tested and at least one antibody as define din 
claim 15 or claim 16. 

18. A method for the detection/diagnosis of S.pneumoniae which comprises the. 
step of bringing into contact a sample to be tested with at least one nucleic acid 
sequence as defined in claim 7 or claim 8. 
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19. A method of determining whether a protein or polypeptide as defined in 
Tables 2-4 represents a potential anti-microbial target which comprises inactivating 
said protein or polypeptide and determining whether S. pneumoniae is still viable in 
vitro or in vivo. 

20. The use of an agent capable of antagonising, inhibiting or otherwise 
interfering with the function or expression of a protein or polypeptide as defined in 
Tables 2-4 in the manufacture of a medicament for use in the treatment or 
prophylaxis of S.pneumoniae infection 
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